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PREFACE 


Great advances in particle physics have been attained in the last several decades. In 
the development of the new generation of accelerators and detectors, the quantity 
and quality of the data on various interaction processes have been vastly increased 
and our knowledge of particle physics became surprisingly fruitful. Moreover, many 
interesting and new ideas were proposed and developed successfully in quantum field 
theories for describing strong, electromagnetic and weak interactions. Those new 
ideas were concentrated into a simple and beautiful theory known as the standard 
model in the late 1960’s which was formulated in the framework of non-Abelian 
gauge theory and was extremely successful in describing a wide range of existing 
phenomena of elementary particles. However, it is also believed that the model is not 
the ultimate theory; it has many arbitrary parameters which cannot be predicted by 
the theory and fundamental problems, such as the hierarchy problem, which should 
be solved in some physics beyond the standard model, “New Physics”. Thus it is a 
general consensus that to search for the evidences for physics beyond the standard 
model is the most urgent issue in the particle physics of the 21st century. 

The aim of the present textbook is to give a unified description of the struc¬ 
tures and interactions of elementary particles, by discussing the underlying theo¬ 
ries, namely the standard model of elementary particles and physics beyond the 
standard model. Especially, concerning the description of the physics beyond the 
standard model, we will select the hot topics of our current interest, including the 
issues inspired by various experiments as much as possible. 

This book has emerged out of the introductory lectures we gave in the graduate 
course or series of lectures given in various places over several years. Therefore, 
this book is written primarily for graduate students and can be used by those who 
wish to major in theoretical and experimental high energy physics. Throughout this 
textbook, it is assumed that the readers have already gone through a basic course 
in quantum field theory and particle physics. We hope that this book is also useful 
for the researchers in the field of particle and nuclear physics. 

There are two main features of this textbook. First, we have aimed at a concise 
description but at the same time we have paid attention so that the basic concepts 
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are clearly mentioned. For such a purpose each section is guided by pedagogical 
arguments. Second, we also attempt to provide excitement in particle physics, 
focusing on the important experimental observations (old and new) and a variety 
of nice ideas for their interpretation. 

The organization of the book is as follows. In Chapter 1, we give a brief in¬ 
troduction of the standard model for the elementary particles to describe their 
structures and interactions. This is followed by Chapter 2 with a description of 
Fermi theory of weak interactions and its limitation. Preparation for the under¬ 
standing of electroweak gauge theory is done in Chapter 3, which covers symmetry, 
symmetry breaking and the gauge theory. Glashow-Weinberg-Salam (GWS) the¬ 
ory of electroweak interactions is covered in Chapter 4. In Chapter 5, quantum 
chromodynamics (QCD), i.e. the theory of strong interactions, is discussed briefly. 
After these chapters, we will come to the description of physics beyond the standard 
model. In Chapter 6, we discuss a current hot topic, neutrino physics, i.e. neutrino 
masses and neutrino ocsillations. Then, in Chapter 7, we deal with supersymmetric 
theories as the typical example of physics beyond the standard model. We further 
attempt to discuss other representative topics in the physics beyond the standard 
model; precision tests of electroweak radiative corrections and new physics search 
through these analyses in Chapter 8 and flavor physics and CP violation, which 
are also sensitive to the presence of new physics, in Chapter 9. Each chapter is 
accompanied by a few problems, hoping that solving them will be of some help in 
the understanding of the main text. The topics we chose depend on our preference. 
However, we believe that those topics must be of general interest and instructive 
not only for students who are going to enter the field of theoretical and experimen¬ 
tal particle physics but also even for researchers in high energy physics. Finally, 
appendices for some basics are given for the reader’s convenience. 

During this project, we have learned a lot from many excellent books and re¬ 
views; some of them (Bjorken and Drell, 1964; Abers and Lee, 1973; Jauneau, 1977; 
Quigg, 1983; Cheng and Li, 1984; Halzen and Martin, 1984; Rudaz, 1986; Aitchi- 
son and Hey, 1989; Wess and Bagger, 1992; Nagashima, 1999) were good guides in 
preparing this book. 

Unfortunately, because of limited space and time, we could not help neglecting 
some of the important topics, such as the full contents of QCD, discussion on the 
grand unified theories (GUT), detailed phenomenological analyses of supersymmet¬ 
ric theories, and so on. Fortunately, concerning these topics, there already exist 
many excellent textbooks or reviews and we refer the reader to some of them: for 
example, for comprehensive discussion of QCD, the textbooks by Yndurain (Yn- 
durain, 1992) and by Muta (Muta, 1998), for discussion of GUT, the book by Ross 
(Ross, 1985) and a review by Langacker (Langacker, 1982), for more details on su- 
persymmtry including its phenomenology, the books by Weinberg (Weinberg, 2000) 
and by Ross (Ross, 1985), and so on. Furthermore, in this textbook the references 
are far from complete but rather limited only to the ones directly related to the 
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discussions extended in each chapter. We apologize to all authors whose papers we 
have neglected in spite of their great contributions to the fields. 

In carrying out this project, we are deeply indebted to many people, including 
our teachers, collaborators, students and colleagues who taught and encouraged 
us a lot. One of the authors (S. N. M.) is grateful to Professor Amitava Roy 
Chaudhuri for useful comments and to Dr. Sudip Sanyal for research collaboration. 
The financial assistance obtained from the Department of Science and Technology, 
New Delhi, India, is also gratefully acknowledged. It took a long time to complete 
this project; we started at the end of 1993 and finally came to an end in the fall of 
2003 after a long struggle. The project was interrupted many times by unexpected 
difficulties which we came upon in the course of this project, such as the terrible 
earthquake in Kobe in 1995, the passing away of the mother of one of us (T. M.) in 
2002, and so on. In spite of the terrible delay of our work, H. T. Leong, editor of this 
book, has encouraged us constantly with a warm heart and it is our great pleasure 
to sincerely thank him for his patient and continuous support for publication of this 
book. It is also our great pleasure to acknowledge the aid of our graduate students, 
S. Oyama, K. Mawatari, K. Sudoh, K. Hasegawa and T. Nagasawa, for their self- 
sacrificing help in drawing and arranging figures and for useful comments. Finally, 
we are greatly indebted to our families for their warm support and encouragement 
throughout this long time. 


T. Morii, C. S. Lim and S. N. Mukherjee 


Kobe, Japan and Vanarasi, India 
November 2003 
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Chapter 1 

INTRODUCTION 


“From the earliest times, man’s dream has been to comprehend the complexity of 
nature in terms of as few unifying concepts as possible”* 

Stars which are twinkling in a night sky have brought us various dreams from 
ancient times. People have been attracted in their mysterious appearance and have 
looked for the physical law of the cosmos; how was the cosmos born?, what are the 
most fundamental building blocks of matter constructing the cosmos?, how do those 
fundamental particles interact with each other or one another? and so on. There 
have been many questions to be answered. Particle physics is the most powerful 
tool for investigating these fundamental questions. 

In this introductory chapter, we make a simple explanation on fundamental 
particles in the standard model and on the characteristic properties of their inter¬ 
actions. 


1.1 Elementary particles in the standard model 

Elementary particles are the most fundamental building blocks of matter. Entity of 
elementary particles has been changed in the long course of development of physics. 
By the end of the last century, people have found that matter is made of molecules 
and/or atoms, atoms are made of nuclei and electrons, nuclei are made of nucleons, 
i.e. protons and neutrons, and finally nucleons are made of quarks. Elementary 
particles of our own time are quarks (together with leptons such as electron). In 
the development of big accelerators in the last 50 years, many (more than 300) 
particles called hadrons were discovered in addition to protons, neutrons and pions. 
There are two kinds of hadrons; baryons (proton, neutron etc.) with the baryon 
number B = 1 and mesons (pions, kaons, etc.) with B = 0. 

The quark model proposed by Gell-Mann and independently by Zweig in 1964 

‘Address by Abdus Salam at UNESCO Celebration of the Centenary of Einstein’s birth, Paris 
May 1979. 
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(Gell-Mann, 1964; Zweig, 1964) classifies all existing hadrons surprisingly well based 
on the internal symmetry of SU (3) for hadrons composed of relatively light three 
quarks, and successfully explains static properties of those particles (see Appendix 
E). In the quark model, baryons are composed of three quarks (qqq ) as p = ( uud ), 
n = ( udd ), A = ( uds ), etc. and mesons are composed of a quark q and an anti-quark 
q as 7 r + = (ud), — (ud ), K + = (us), K~ — ( su ), etc. Nowadays six different 

quarks u, d, s, c, b, t are known to exist and thus, it is said that quarks possess 6 
degrees of freedom called “flavor”. A quark flavor can change into another quark 
flavor through weak interactions mediated by charged weak bosons W ± , which are 
predicted to appear in the electroweak standard model as discussed later in Chapter 
4. In addition to flavor, quarks have another degree of freedom called “color” as 
briefly discussed in Chapter 5. The interaction between quarks due to the color 
“charge”, which is nothing but the strong interaction, is mediated by gluons and is 
described by quantum chromodynamics(QCD). QCD is the gauge theory with color 
SU (3) symmetry. While the flavor symmetry is broken by the difference of quark 
masses, largely for heavy quarks, the color symmetry is an exact symmetry. 

Another kind of elementary particles called leptons (electron e, muon /j ,, tau 
r and their corresponding neutrinos v e , v^, v T ) exist in Nature. Leptons are free 
from the strong interaction and have no color degrees of freedom, i.e. leptons are 
colorless. Among them, neutrinos possess only weak interactions, while e, /i and t 
have the both of weak and electromagnetic interactions. Productions and decays 
of leptons are successfully described by the electroweak standard model, i.e. the 
SU(2)l x U(1)y gauge theory of electroweak interactions, discussed in chapter 4. 

The elementary particles in the standard model are as follows; 


Quarks (d)’ (»)' (0* (u) 

Leptons ( "* ) , ( . ( * ) , (1-2) 

{ photon 7 , 

weak (gauge) bosons W^,Z 0 , (1.3) 

gluons g, 

Higgs bosons H. (1.4) 


Quarks and leptons are fundamental building blocks of matter. All of them are 
fermions and have spin |, whose quantum numbers are summarized in Table 1.1 
and Table 1.2. It is interesting to note that both quarks and leptons are paired into 
three doublets, the members of each doublet participating in the charged current 
weak interaction processes together. The repetition of the doublets is stated as there 
are three generations of quarks and leptons. The corresponding particles in different 
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Q 

h 

s 

c 

B 

T 

mass 

u 

+ 1 
' 3 

+k 

0 

0 

0 

0 

1.5 ~ 5 MeV 

d 

1 

3 

1 

2 

0 

0 

0 

0 

3 ~ 9 MeV 

s 

1 

3 

0 

-1 

0 

0 

0 

60 ~ 170 MeV 

c 

+ 2 
' 3 

0 

0 

+1 

0 

0 

1.47 ~ 1.83 GeV 

b 

1 

3 

0 

0 

0 

-1 

0 

4.6 ~ 5.1 GeV 

t 

+- 

0 

0 

0 

0 

1 

174.3 ± 3.2 ± 4.0 GeV 


Table 1.1 Quarks (Q: electric charge, / 3 : 3rd component of isospin, S: strangeness, C: charm- 
ness, B: bottomness, T: topness. These quantum numbers change their signs for anti-quarks.) 



Q 

L e 

Lp 

L T 

mass 

e 

-1 

+ 1 

0 

0 

£ 0.511 MeV 

t*e 

0 

+ 1 

0 

0 

< 3eV 

A* 

-1 

0 

+1 

0 

“ 105.66 MeV 


0 

0 

+1 

0 

< 0.19 MeV 

r 

-1 

0 

0 

+1 

= 1777.0 MeV 

J'r 

0 

0 

0 

+1 

< 18.2 MeV 


Table 1.2 Leptons ( Q: electric charge, L e - electron number, L M : muon number, L r \ tau number. 
These quantum numbers change their signs for anti-leptons.) 


generations, for instance u, c, t have exactly the same quantum numbers. The only 
property to distinguish different generations is the difference of the masses of quarks 
and leptons, depending on the generation. The electric charges of upper and lower 
components of quark doublets are +| and — |, respectively, while those of lepton 
doublets are 0 and — 1 , respectively. At the first sight, such charge “quantization” 
seems to be miraculous. The quantization, however, is known to be well-suited to 
the anomaly-free condition for the standard model as explained in Chapter 4. 

Gauge bosons having spin 1 are mediators of interactions between quarks or 
leptons. Interaction strength depends on which gauge bosons propagate between 
quarks or leptons. Electromagnetic, weak and strong interactions are mediated by 
photons 7 , weak bosons W ± , Z° and gluons g, respectively. A photon does not 
couple to itself, while gluons and weak bosons have self-couplings, i.e. can couple to 
themselves. This is due to the fact that the electromagnetic interaction is described 
by the Abelian gauge theory, while the strong and weak interactions are by the 
non-Abelian gauge theories. Among those gauge bosons shown in (1.3), a photon 
7 and gluons g are massless and hence their interaction ranges are infinite, though 
actually the argument on the strong interaction does not go so straightly because of 
the non-Abelian nature of color interactions and quarks are confined inside hadrons 
with the range ~ 0(lO - 15 m). On the other hand, the weak bosons IT ± , Z° are 
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massive and their interactions are very short-range, such as - ~ (D( 10 - 18 m). 

The Higgs boson with spin 0 is introduced for the Higgs mechanism to work, 
which is operative in the theories with spontaneous symmetry breaking of local 
gauge symmetries as will be discussed in Chapter 3. In the Higgs mechanism a 
larger symmetry is spontaneously broken into a smaller symmetry through the vac¬ 
uum expectation value of the Higgs field and accordingly (a part of) gauge bosons 
become massive. The masses of not only gauge bosons but also all quarks and 
leptons are originated from the spontaneous symmetry breaking in the standard 
model, though neutrino masses are a little bit controversial. The real understand¬ 
ing of the mechanism of the spontaneous breakdown and the Higgs mechanism is 
still extremely challenging problem to be solved in field theories. Obviously, the dis¬ 
covery of the Higgs boson in experiment will provide a crucial hint to the problem. 
However, so far there is no evidence of the production of the predicted Higgs boson 
in high energy reactions. To uncover the Higgs boson is the most important issue 
to finally establish the standard model or even to search for the physics beyond the 
standard model, “New Physics”. This hopefully can be performed in the early time 
of this century. 


1.2 Interactions among fundamental particles 

It is well known that there are 4 characteristic interactions among fundamental 
particles: 1. Electromagnetic interaction mediated by massless photons (m 7 = 0) 
with spin=l, 2. Weak interaction mediated by massive weak bosons (mw = 80.4 
GeV/c 2 , rnz = 91.2 GeV/c 2 ) with spin=l, 3. Strong interaction mediated by 
massless gluons (m s = 0) with spin=l and 4. Gravitational interaction mediated 
by massless gravitons (mo = 0) with spin=2. (see Table 1.3.) 

Among these interactions, the gravitational interaction is usually out of game for 
particle physics because it is extremely weak compared with other interactions and 
has no meaningful effect on any reactions of those particles (,unless the energies of 
interacting particles are extremely high). For example, the ratio of the gravitational 
force to the Coulomb (electromagnetic) force between 2 protons at the distance 
10 ~ 15 m is about 10 -36 . 

The electromagnetic interaction mediated by a photon 7 has a long history of 
investigation and now it is known to be described by quantum electrodynamics 
(QED) which is the gauge theory having the Abelian U{1) symmetry. QED is 
beautifully formulated in the framework of quantum field theory and is renormaliz- 
able, i.e. various divergences originated from the loop integrals in the higher orders 
of perturbation theory can be renormalized into physical masses and wave func¬ 
tions of particles. Because of smallness of the coupling constant a = ~ the 

perturbation works well for QED. 

As will be described in Chapter 2, the theory of weak interactions for weak 
processes originally formulated by Fermi, was developed in 1950’s and excellently 
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Interaction 

coupling strength 

mediator 

spin 

1. Electromagnetic 

a — — — — 

a 4-k - 137 

photon 

1 

2. Weak 

G f = 1.16 x 10 -5 GeV -2 

weak boson 

1 

3. Strong 

— 0 1 
a s — ilT - u.i 

gluon 

1 

4. Gravitational 

G n = 6.71 x 10 -39 (GeV/c) -2 

graviton 

2 


Table 1.3 Particle interactions 

described by the current-current interaction with V - A currents. It works well 
for low energy processes. Unfortunately, the theory is not renormalizable in spite 
of its small coupling constant. This is due to the fact that the Fermi coupling Gf 
has the dimension of [mass] -2 . Thus the Fermi interaction should be regarded as 
the effective model for weak processes working only in the low energy region. In 
the dedicated study of weak interaction physics in 1960’s, many theoretical difficul¬ 
ties in the weak interaction were surmounted. A beautiful renormalizable theory 
was finally formulated, based on the unified picture of weak and electromagnetic 
interactions, in the framework of non-Abelian gauge theory with SU(2)l x C/(l)y 
symmetry (the subscript L means the fields participating in the interaction are 
left-handed and Y denotes the weak hypercharge), which is now called the elec- 
troweak standard model. One of the main themes of this textbook is to describe 
the structure and physics of this model. 

The strong interaction is mediated by gluons which have color charges. Since 

quarks have also color charges, gluons can couple to quarks. The field theory for the 

strong interaction is formulated in the non-Abelian gauge theory with SU(3) C color 

symmetry and is called quantum chromodynamics(QCD). The coupling constant of 

QCD has conspicuous behavior for a variation of momentum transfer square Q 2 , as 

2 

briefly described in Chapter 5. The strong coupling constant a s (Q 2 ) = “runs” 
as Q 2 varies. On one hand, a s (Q 2 ) becomes small for large Q 2 region as realized in 
hard scattering such as deep inelastic scattering, where quarks and gluons behave 
as free particles, implied by the word “asymptotic-free”, and in such regions the 
perturbation theory works well. On the other hand, for small Q 2 region as realized 
in the static state of bound quarks inside hadrons, a s (Q 2 ) becomes large and in this 
region the perturbative treatment is not reliable, where quarks are confined inside 
hadrons (in color singlet states). This is called the “confinement” phase. QCD must 
be the theory for describing the dynamics of quarks and gluons in all Q 2 regions 
from “asymptotic-free” to “confinement” phases. 

Weak and electromagnetic interactions are formulated by the gauge theory with 
SU{2)l x U(l)y symmetry and furthermore, the strong interactions are described 
by the gauge theory with color SU( 3) c symmetry. Hence, one can naturally expect 
that all these interactions of elementary particles must be described by the gauge 
theory with some internal symmetry G, that is, the Lagrangian has to be invariant 
under the gauge transformations of G. The simplest example is to take the symme- 
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try group G to be a direct product of each symmetry, G — SU(3) c x.SU(2)l'xU(1)y- 
The resultant theory is called the standard model of particle physics. The impor¬ 
tant principle in the formulation is that (l)the theory is the gauge theory, (2)it 
must be renormalizable and anomaly-free and (3)the symmetry breaking must oc¬ 
cur spontaneously. 

Unfortunately, the standard model has many problems to be solved and many 
theorists believe that it is not the ultimate theory. To solve these remaining prob¬ 
lems and go further, we must look for evidences for the physics beyond the standard 
model, “New Physics”. The discussion of the New Physics is another main theme 
of this textbook and will be discussed in detail from Chapter 6 to Chapter 9. The 
issues discussed in these Chapters are selected from the challenging hot topics of 
our current interest. 





Chapter 2 

WEAK INTERACTION 


The theoretical description of weak interaction processes was first worked out by 
Fermi in 1933 for the /3-decay of nuclei (Fermi, 1933). The following two decades saw 
the refinement of the explicit form of the current-current interaction first suggested 
by Fermi. The discovery of parity violation by Wu et al. in 1957 (following the 
theoretical work of Lee and Yang) led Feynman and Gell-Mann and also Marshak 
and Sudarshan to the vector minus axial vector V — A structure of the weak current 
in 1958 (Feynman and Gell-Mann, 1958; Marshak and Sudarshan, 1958). In 1961, 
Gell-Mann and Neeman independently introduced the SI/(3) symmetry of strong 
interaction (Gell-Mann and Neeman, 1964) and in 1963, Cabibbo (Cabibbo, 1963) 
used the hypothesis that the weak currents of hadrons have definite SU (3) transfor¬ 
mation properties to describe the decay of strange particles in the current-current 
coupling scheme. 

In this Chapter, after touching on the parity violation in weak interaction pro¬ 
cesses, we describe the development of weak interactions following the above histor¬ 
ical chronology. We also discuss some important subjects in the Fermi theory, i.e. 
the lepton current universality, pion decays and Cabibbo currents. Unfortunately, 
the Fermi theory is not complete. It has serious fundamental difficulies, i.e. unitar- 
ity violation and non-renormalizability. These difficulties will be discussed in the 
subsequent section. Finally, we will give some discussions on the intermediate weak 
boson model, which is an alternative model of the Fermi theory, as the preliminary 
discussion of the Standard Model. 


2.1 The Fermi Theory of Weak Interaction 

The weak interaction among elementary particles was first discovered in the /3-decay 
of nuclei. Basically it is realized as a decay of neutron n in the nucleus into a proton 
p, an electron e~ and an anti-neutrino 9, 

n p -f e~ + V. (2.1) 

7 
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The life of a free neutron due to the /3-decay is about 900sec which is extremely 
longer than that of other particles induced by strong interaction like r(p° -> 
7T+7T - ) ~ 10~ 24 sec or electromagnetic interaction like r( 7 r° —> 27 ) ~ 10~ 18 sec. 
Since the nuclear /3-decays share common nature with other weak processes, they 
are classified into weak interaction processes, though the neurton life-time is much 
larger than that of typical weak decays like r(A° -A pn~) ~ 10~ lo sec. 


2.1.1 Parity Violation in Weak Interaction 

One of the most conspicuous properties of weak interactions is parity violation. 
Before 1956 when Lee and Yang (Lee and Yang, 1956) proposed an idea of parity 
violation in particle physics, people had considered that all physical processes were 
invariant under space inversion and thus parity was a good quantum number for all 
interactions. After the experimental discovery of parity violation by Wu et al. in 
1957 (Wu et al, 1957), much progress was made for determining the weak interaction 
type which was finally established as V - A interactions. The story of this progress 
is exciting and will be discussed in the next section. But before coming into this 
history, here we would like to touch on the story of discovery of parity violation. 

It is not in the /3-decay of neutron but in the K + decay that Lee and Yang 
suggested first an idea of parity violation. K + meson decays into 7r + 7r° (Br = 
21.1%) and also even into 7T + 7r + 7r~ (Br = 5.6%), where Br denotes the branching 
ratio. Since a 7r meson is pseudoscalar, its intrinsic parity is —1. The spin of the 
K meson and 7r meson is 0 in either case. Therefore, for 7r + 7r° decays, the orbital 
angular momentum of 7r + 7r° is 0 because of total angular momentum conservation 
and thus the parity of the 7r + 7r° system becomes +1. On the other hand, for the 
7r + 7r + 7r _ system, the total orbital angular momentum of this system is given by the 
sum of the orbital angular momentum between two tt + and the one between the 
center of mass of two 7r + and the remaining 7 r~, The sum should be 0 because it 
must be equal to the spin of the K meson which is 0. This can be realized only 
when the magnitude of these two orbital angular momenta are equal. Therefore, the 
parity of this system due to the orbital angular momentum becomes +1. However, 
since the intrinsic parity of 7 r meson is —1, the intrinsic parity of the three pion 
system is (-1) 3 = -1 and hence the parity of 7r + 7r + 7r _ becomes -1, after all. Thus, 
the parity of 7r + 7r° is +1, while that of 7r + 7r + 7r~ is -1. 

Then, if parity is conserved in these decays, we must have two independent K 
mesons, say 6 + and r + ; one is of even parity and another is of odd parity. In fact, 
in those days the decays were understood as 9 + -A 7r + 7r° and r + —> 7r + 7r + 7r _ . It, 
however, seems to be too accidental that these two particles 6 + and r + have the 
same mass, the same life-time, the same spin and so on. This problem was known 
as the r — 6 puzzle in those days. To solve the puzzle, Lee and Yang suggested 
that the parity is violated in weak decay processes. If this is the case for any weak 
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Fig. 2.1 

interaction processes, the effect should appear even in the 0-decay of nucleus. They 
proposed to observe the correlation between the spin J of 60 Co and the momentum 
p e of elecron produced in the following /3-decay, 

60 Co(J p = 5+) -* 60 Ni(4 + ) +e~ +9. (2.2) 

First, align the spin J of 60 Co in a +z direction perpendicular to a fixed x — y 
plane and then set a mirror in parallel to the x — y plane at some point z (Fig. 
2.1). The direction of spin J of 60 Co in the mirror image is the same as that of the 
real 60 Co because of axialvector nature of J. Then, when an electron is produced 
in a direction, forming an angle 9 relative to J, the electron in the mirror image is 
produced in a relative angle it — 6 because of vector nature of p e as shown in Fig. 
2.1. Therefore, if the process is parity invariant, the probability for finding electrons 
produced with the angle 9 and tt — 8 must be the same. Wu and her collaborators 
observed that the probability for finding electrons produced with 6 = tt was much 
larger than the one for the case of 9 — 0. This was the first discovery of parity 
violation. 


2.1.2 Road, to Current-Current V — A Interaction 

In 1933, Fermi proposed an idea that the light leptons e~ and v are emitted from 
the neutron n in the /3-decay (Fig. 2.2(a)) like a photon 7 emitted from radioactive 
nuclei, and treated the process based on quantum field theory. In quantum electro¬ 
dynamics, the radiative process p -»• p + 7 (Fig.2.2(b)) where p is a proton in the 
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Fig. 2.2 


nucleus is described by 

e jjr ] A» = e [Up^up] A 1 *, (2.3) 

where e is the electric charge of proton and u p and are the Dirac spinor of 
proton and the photon field, respectively. The factor j t 'p Tn ' > = u p jpU p is called an 
electromagetic current of proton. To describe the /3-decay process n -A p + e~ 4- 9, 
Fermi replaced A M and the electromagnetic current jff m ^ by a 4-vector weak lepton 
current — u e T^u v made of e~ and u fields and the weak nucleon current 

_ Upj^Un of p and n fields, respectively, as 

Gji n ^ p) j(^ e ) = G [u P 7„u„] [u e ^u u \, (2.4) 

where G is the weak coupling constant being much smaller than the electromagnetic 
coupling constant e. (Actually GE 2 e 2 for moderate energies E.) That is, the /3- 

decay interaction is described by the product of two weak currents; nucleon current 
ant [ lepton current in which 4 fermions p, n, e~ and 9 couple at 

the same space-time point. There is no propagator connecting these two currents 
and this is apparently different from electromagnetic processes where, for instance, 
proton current and electron current are mediated by a photon with its propagator. 
In addition to a big difference in magnitude of G and e, it is remarkable that the 
weak nucleon and lepton currents are charged,* while the electromagnetic currents 
are neutral. 

Taking account of the parity violation in weak interaction, Fermi’s original idea 
for the /3-decay n p+ e~ +9 can be generalized by writing the weak interaction 
Hamiltonian for this process as (see, for example, Jauneau, 1977) 

Q. 

Hw = 12 Y [^Oi{l + Ci%)ijj v \ + h.c., (2.5) 

i 

where Cj can be taken to be ±1 since the parity violation is maximum and actually 

‘Long later, in 1973 the weak neutral current was also discovered with neutrino elastic scattering 
p + e~ —t P + e~ and deep inelastic scattering u + IV —> u + X and P + N —> P + X as predicted 
by the Glashow-Weinberg-Salam(GWS) theory. 
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Ci becomes —1 because the neutrino is left-handed, as discussed later in 2 . 1 . 2 .3. Gi 
is the coupling constant for the type i(= S, V, T, A, P) corresponding to the specific 
Lorentz structure of interactions and Oi is given by 

Os = 1, Scalar(S) 

Ov = In, Vector(V) 

Or = - 7*7m)> Tensor(T) ( 2 . 6 ) 

Oa = 757 M , Axialvector(A) 

Op = 75 . Pseudoscalar(P) 

Assuming the time reflection invariance in the /3-decay, we can take Gi to be real. 
Then, the remaining task is to determine experimentally which type of interaction 
among (2.6) works in the /3-decay. 


2.1.2.1 Helicity and chirality 

Let us start with the plane wave of a fermion with momentum p, energy E and 
mass m. The equation of motion of this particle is given by the Dirac equation, in 
the natural unit, h = c — 1 , 


(a ■ p +/3m)rp = Eip, (2.7) 

where a and /3 are 4x4 matrices and related to 7 -matrices as 7 0 = /3 and 7 = /3a. 
For a massless particle with m = 0 as in the case of neutrino*, we have 


a ■ pip = ptp. 


( 2 . 8 ) 


In the 7 °-diagonal representation, 75 = ^ ^ ^ ^ (see (A.25)) and by introducing 


a 4 x 4 matrix S' = 


5 0 
0 a 


, one can write a 


1 ) B 
a 0 


as a = 75 S' - < 7 ' 75, 


where 3 is the Pauli matrix. Now using 7 ! = 1, one can obtain the following 
equations from ( 2 . 8 ) 


—-Tp = 75^, (2.9) 

p p 

t Recently, due to observation of neutrino oscillation, it has been strongly suggested that neutrinos 
are massive. But in those days neutrinos were considered to be massless and here we treat 
massless neutrinos. In any case, neutrino mass has to be extremely small even if it is massive 
and it is a good approximation to consider the massless neutrinos. 
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Then, by taking addition and subtraction of two equations in (2.9) and introducing 
the helicity operator defined by h = (<? ■ p)/2p, we obtain from (2.9) 

hipR = R, hip L = ~i>L, ( 2 . 10 ) 

where ip R = P R ip with P R = (1 + 7s)/2 and ip L = P L ip with P L = (1 - 7 5 )/2 
and they are called right- and left-handed fermions, respectively. P R and Pl are 
projection operators for right- and left-handed fermions, respectively. (2.10) shows 
that ip R and tpL are eigenstates of helicity with eigenvalues h = +1 and h — — 1 , 
respectively*, h — +1 means that the particle spin is parallel to the direction of 
its momentum p and h — — 1 means that the particle spin is anti-parallel to its 
momentum p. 

75 is called chirality operator and the following eigenvalue equations hold for 
any fermions with or without masses 


75 4>R = i>R, = -tpL- 


( 2 . 11 ) 


Comparing (2.10) with (2.11) we see that helicity h is the same as chirality 75 for 
massless fermions. This is true even in the 75 -diagonal representaion. (see Appendix 
A.5.) 

Now let us consider a free electron with nonzero mass m. We are interested 
in the effect of the projection operators P R> i, on the Dirac spinor of the electron, 
which is the solution of (2.7) 


x/> = N 


X 

ap ^ 
E+m / 


( 2 . 12 ) 


where N is the normalization factor and x = 


1 


and 


0 


represent that the 


0 j “ v 1 

electron spin is parallel and anti-parallel to the z-axis, respectively. With (2.12), 
we obtain 


IpR 


X 


1 + * 


IpL 


X 


X 


(i-rfs;) 


(2.13) 


Now let the electron momentum p be along the z-axis, for simplicity. Then, we can 
calculate the expectation value of helicity h^ of electron as 


p{e) _ < ^R(L)\h {e) \lpR {L ) > = 

< V'Jl(L)IV'i?(L) > E' 


(2.14) 


Though according to the definition of helicity operator, the face value of helicity eigenvalue for 
fermions with spin ~ is h = ^ or h = the two times value of it is conventinally used and we 
follow this convention in this textbook. 
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where the sign +(-) corresponds to ipR(L)- As § equals to v (in the natural unit 
c = 1 ), with v being the velocity of electron, approaches to ±1 for relativistic 
electron with v ~ 1 . In other word, Pr(l) picks up approximately the = + 1 (— 1 ) 
state for a high energy electron(see also Appendix A.5). 


2. 1 . 2.2 Observation of electron helicity in the /3-decay 

In 1957, Frauenfelder et al. observed the longitudinal polarization of electron P^ 
in (2.14) in the /3-decay 60 Co -» 60 Ni + e~ + u, which was measured to be —v, i.e. 
chirality of electron was approximately left-handed (Frauenfelder et al, 1957). There 
are several methods to observe the longitudinal polarization of electron. Many 
experiments were performed in those days and the conclusion was that electrons 
have negative helicity (left-handed) while positrons, i.e. anti-particles of electrons, 
have positive helicity (right-handed). 

If electrons are left-handed, we can replace ip e —> 1 ~ 75 ip e and also ?/> e —> 
in the Fermi interaction (2.5). Then the lepton current factor t/i e Oj(l + Ci^ip^ 
becoms 

_ _ X 

lf e Oi( 1 + Ci7 5 )VV -» V>e-Ot(l + c »75)^ 

= 1 + CiJ 5 )lp„ 

= tPeOiilTCi)^^^, (2.15) 

where we used the relation 7^75 = — 757 ^. The sign in the right-hand side of (2.15) 
shows that — is for i ~V,A and + for i = S, T, P. This equation is very important 
in order to determine the type of O,. If neutrino helicity is measured to be —1, one 
can say that the interaction type must be V or A or a combination of them, and c* 
becomes — l(c* = +1 is not allowed). On the contrary, if it is +1, the interaction 
type should be 5, T, P or a combination of them, and Cj become +l(cj = —1 is not 
allowed). Therefore, the next step is to know neutrino helicity which is necessary 
to determine the type of weak interactions. 


2.1.2.3 Determination of neutrino helicity 

In 1958, Goldhaber et al. directly measured neutrino helicity in their exquisite 
experiment (Goldhaber et al., 1958). To measure neutrino helicity, they studied 
the A'-capture decay of 152 Eu to 152 Sm; 152 Eu first absorbs an electron in its atom, 
which is called A'-capture since an electron is absorbed from the AT-orbit, and decays 
into an excited state of 152 Sm* and a neutrino 17 and then finally 152 Sm* comes to 
its ground state by emitting a photon 7 . After all, the total reaction is as follows; 

152 Eu(J p = (T) + e- -» 152 Sm*(l _ ) + z/ -> 152 Sm(0+) +7 + 1 /. (2.16) 
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In this process, the produced neutrino cannot be detected because it is neutral and 
has no electromagnetic interaction. 

Now, let us consider a photon which runs only in the direction parallel to 
152 Sm(0 + ) in the final state. If we take the direction of running neutrino as +z-axis, 
we see that only two cases of spin alignment are allowed in this process: 

152 Eu(0~) + e~ —> 152 Sm(0 + ) + 7 + 1 / 

(i) 0-| 0-1 +i (2.17) 

(ii) 0+1 0+1 

If fil") = +1, i.e. jl'O = +1, then j| 7 ^ = -1 for the produced photon because 
of the angular momentum conservation. It means that the produced photon is 
right-circularly polarized, since it is running along —2 direction. On the contrary, 
if fil") = — 1 , i.e. jl‘' ] = — |, then the produced photon must be left-circularly 
polarized. The key point of this experiment was that they determined neutrino 
helicity by measuring the photon circular (left- or right-) polarization. The 
observed photon was left-circularly polarized, which led to = — 1. This results 
in a = — 1 as seen from (2.15) with maximum parity violation. Prom this result, 
the lepton current in (2.5) becomes a V — A type, i.e. ^7^(1 — 7 s)t/v• 

From the above observation, we can say that the allowed types of the nucleon 
current for /3-decay processes are also Oi = V and A because of Lorentz invariance 
of the interaction Hamiltonian (2.5). Historically, it has been well known that in 
the nuclear /3-decay there are two types of transitions, i.e. (l)Fermi transition and 
(2)Gamow-Teller transition, depending on the spin of the lepton pair e~v which 
can be singlet(S = 0) and triplet(5 = 1 ) states, respectively. In the nonrelativistic 
approximation which works for ordinally /3-decay processes, one can easily find 
that Fermi transitions occur for Oi — S, V, while Gamow-Teller transitions do for 
Oi = T, A. Moreover, Oi = P contributes to Fermi transitions only in the order of 
\ and thus can be neglected in the nuclear /3-decay. The process discussed here 
is the Gamow-Teller transition and hence the above result tell us that Oi = A is 
allowed but T is not allowed for the nucleon current of this process. 

2.1.2.4 Angular correlation between e + and v 

Another interesting experiment for studying the interaction type of /3-decays was 
measurement of angular correlation between e + and v in the decay 35 A —> 35 Cl + 
e + + v which was observed by Allen et al. (Hermannsfeldt et al, 1957). It is known 
that this process is Fermi transition (spin of the lepton pair e + v being 0) and only 
Oi = V or S contribute to this transition (Kistner at al, 1956). In those days before 
the experiment by Goldhaber et al. (1958) which established the neutrino helicity 
to be /(/") = — 1 , people were interested in which one of V and S is chosen by 
Nature. 

Let us take the direction of the emitted positron e + to be the positive z-axis and 
consider two possible cases; e + and v are emitted (l)in parallel with 0 = 0 , where 
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9 is an angle between momenta of e + and v, and (2)in anti-parallel with 9 = tt. 
Since the helicity of e + is +1, the angular momentum of e + is J z — +~. Then, if 
the helicity of neutrino is negative, i.e. = -1, we can expect, from (2.15) and 
the angular momentum conservation, that the case (1) is possible only for Oi = V 
but is not allowed for S and the case (2) does not occur for the Fermi transition 
with Oi = V. That is to say, it is forbidden for e + and v to be emitted in parallel if 
Oi = S. On the contrary, if h^' 1 = +1, the case (2) can occur and this time Oi = S 
is possible and V is not allowed from (2.15). Allen et al. observed that the case (1) 
was realized. 

From those many experimental observations, it was concluded that the weak 
interaction of the /3-decay is given by a combination of V and A as follows; 

= -y- [ippln^n] |/0e7 M (l - 7s)i/v] 

/7 

+ ~Y [VV7s7[V>e7 M (l - 7s)Vv] + h.c. (2.18) 

2.1.2.5 V — A interaction 

Experimentally, Gy was determined from the decay 16 O(0 + ) -> 14 N(0 + ) + e + + v. 
In this process, the spin of initial and final nucleons does not change and the spin 
sum of e + and v is 0. Thus, the process is Fermi transition and hence only Gy in 
(2.18) can contribute to this decay and then from this decay width one can obtain 
the value of Gy-, when we define Gg = Gy /\/2, we obtained 

Gg = 1.147 x 10 -5 GeV -2 ss 1 0~ 5 /m 2 p , (2.19) 

where m p is the mass of proton being the order of 1 GeV. Using this value, G a can be 
obtained, in principle, from the /3-decay of neutron, where the Fermi transition and 
the Gamow-Teller transition coexist. There was a long history of this measurement 
and now we have cja = \G a/Gv\ ~ 1.26. Unfortunately, the relative sign of Gy 
and Ga, cannot be determined from unpolarized experiments. To determine it, one 
must study the polarized experiment. 

Now, let us consider the /3-decay of neutron whose spin is polarized to the +z- 
axis and let electron and anti-neutrino momenta be given by p e and ftp, respectively. 
Since the mass difference between neutron and proton is tiny, the produced proton’s 
momentum is very small and thus, it is a good approximation to treat the nucleon 
current nonrelativistically. First consider the first term of (2.18), i.e. the Fermi 
transition term. Using the nucleon wave function which is of the same form as (2.12) 
with almost equal mass m p ss m n « m for proton and neutron, we see that Up7oUn 
is of 0(1), while t/' P 7 i/’„ is of 0{j^) in the nonrelativistic approximation. Thus, 
we can neglect the contribution of V’p'fV’n and the remaining term is only i/'p7o'0n 
which can be written as XpXn i n the nonrelativistic approximation using the Pauli 
spinor Xp(n) °f proton(neutron). It has no spin flip operator 5 and therefore, in 
this case only two patterns of the spin and momentum arrangement for particles 
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Fig. 2.3 Spin arrangement in the neutron 0-decay. Long arrows show the momenta of e and v e 


participating in this /3-decay can be allowed as shown in Fig. 2.3(a) and (b). Next, 
let us consider the second term of (2.18), i.e. the Gamow-Teller transition term. 
This time the same wave function of nucleons leads to that V’pTTsV'n is of 0(1), while 
V’p7o7sV’n is of 0(£). Thus, we can neglect tp p j Q j 5 ip n . Only Vy77sV’n contributes 
to this transition. t/’pTysV’n results in Xp^Xn in the nonrelativistic approximation. 
For <73 term in Xpd'Xn, the spin and momentum arrangement of particles are same 
as shown in Fig. 2.3(a) and (b). However, <ti and <72 terms contribute to spin flip 
and thus we have the spin and momentum arrangement as shown in Fig. 2.3(c). 

After all, in the nonrelativistic approximation we have the following interaction 
Hamiltonian for the /3-decay of neutron. 

H w = G v [XpXn] [vivA 

+ G a [XpO'Xn] [vt°riv\ + h.c., (2.20) 

where in (2.18) is replaced by t] v being the two-component neutrino wave 

function with helicity ~ — 1 and r] e is also the two-component Weyl spinor 
of electron with helicity h^ = — 1 (see (A.53) for the notation of two-component 
Weyl spinors). Now, let us calculate the matrix element for the process presented 
in Fig. 2.3(c) where the neutron spin flips. The spin flip is originated from the 
following terms in (2.20), 

[Xp^iXn] [vt<riVv\ + [XpC^Xn] [vU2T)»\ 

= 2 [Xptf+Xn] [nt<r-Vv\ + 2 [XpO'-Xn] [nU+riu] , (2.21) 

where <7± = (<7i ± *cr 2 )/2 and actually only the 2nd term of (2.21) can contributes 
to the spin arrangement presented in Fig. 2.3(c). We can parametrize this matrix 
element as 

M (c) = 2 G a F, (2.22) 





The Fermi Theory of Weak Interaction 


17 


where F is a factor containing all dynamics of this decay process except for the 
coupling constant Ga- 

Next, let us consider the case of Fig. 2.3(a). This time, the 1st term of (2.20) 
and the 173 term in the 2nd term of (2.20) contribute. In Fig. 2.3(a), the electron 
spin being in parallel to the neutron spin is along the +z-axis. Hence 03 works as 
+1 in the matrix element and we can write the matrix element for this process as 

M (a) = (G v + G a ) F. (2.23) 

For the case of Fig. 2.3(b), since the electron spin is along the —z-axis, we obtain 
the following matrix element, 


M {b) = {G v - G a ) F. (2.24) 

Then, we can calculate the probability for an electron to be emitted in parallel to 
the neutron spin as shown in Fig. 2.3(b), as 

PtftPe t) = \M {b) \ 2 = \G v -Ga\ 2 \F\\ (2.25) 

To the contrary, the probability for an electron being emitted in anti-parallel to the 
neutron spin, which is given by the sum of Fig. 2.3(a) and (c), can be calculated as 

P{8 1 p e ;) = + |M< C >| 2 = |GV + G a | 2 |F| 2 +4|G a | 2 |F| 2 . (2.26) 

If Gy « Ga, then P{8 t Pe t) ~ 0, which means that an electron cannot be emitted 
in parallel to the neutron spin. If Gv ~ -Ga, then P{a t Pe t) w P(<? t Pe I), 
which means that the number of electrons being emitted in parallel and anti-parallel 
to the neutron spin is almost same. The latter case was observed in experiment 
and then it was finally established that the weak interaction Hamiltonian for the 
/3-decay was described by the V — A current-current interactions as summarized as 
follow; 

H w = Q= [V' p 7m( 1 - flU75)V>n] [i/’e7 M ( 1 - 75)^] + h.c ., (2.27) 

where the Fermi constant Gp is given by (2.19) and <?a = -Ga/Gv — 1-26. 


2.1.3 Lepton Current Universality 

The discovery of muon neutrino as a new lepton distinct from an electron neutrino 
led to the concept of p — e universality. A muon decays into an electron as fi~ —> 
e~ + i> e + 17 , with its life-time r M = 2.2 x 10“ 6 sec. At first sight, pT could decay 
electromagnetically into e~ by emitting a photon as p~ —> e~ + 7 , because this 
process is kinematically allowed. However, there has been no observation for this 
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process so far.§ Existence of the particular decay mode pT -+ e~ + v e + z/ M and 
non-existence of the decay mode pT —i e _ + 7 led that in Nature there are additive 
conserved lepton numbers: the electron number L e (L e = -f 1 for e~ , v e and L e = — 1 
for e + , D e ) and the muon number (L^ = +1 for p~, i/ M and = -1 for p + , 
P M ). e~ interacts with n e alone and rT interacts with alone. Later, the third 
lepton members, the tau r and its neutrino v T (tau neutrino) were also dicsovered 
with additive conserved tau number L r (L T = +1 for t~ , v T and L T = -1 for t + , 
u T ). Now we have three families of leptons as 



The universality of weak interactions is expressed by writing the total weak current 
of leptons as a sum of an electron, a muon and a tau current with equal weight as, 

jjP = 4 e) + 

= i'e7M( 1 ~75)e + iV7M( 1 -75)/i + i'T7M( 1 -75)'r, (2.29) 

which can be rewritten as 


Ji e) = K Vrhn(l ~ lh)V 



(2.30) 


where the wave function of each particle is written by the symbol representing the 
particle itself and V is a 3 x 3 diagonal matrix given as follows; 


1 0 0 \ 

0 10. (2.31) 

0 0 1 / 

The explicit form of V shows the lepton universality. 

Now let us concentrate on the muon decay -» e~ {p e ) + v e {k Ve ) + v, 

where momentum of each particle is given in parenthesis. The decay amplitude is 
given by 

M= ~^ ^ kv ^ lp ^ 1 ~ 75 )u(p»)] [u(p e )Y(l - 7sMAvJ] , (2-32) 

where u and v denote the spinors for particles and anti-particles, respectively. The 
weak coupling constant can be determined from measured value of the life-time 
of muon. Since the muon decay involves no complication due to hadronic currents, 
we can accurately determine the value of G fl . 



§Only upper bound of the branching ratio for this decay mode, being extremely small, is known: 
Br(fi —> e + 7 ) < 1.2 x 10 -11 . 
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The decay rate (see (B.15) is given by 

1 


dF = 


2 E u 


\M\ 2 dR , 


(2.33) 


where dR is the Lorentz invariant phase space (see (B.14) given by 


dR = 


d 3 p e 


d 3 k, 




d 3 k v 


(2tt) 3 2 E e (2tt) 3 2 u v , (2n) 3 2u v 
1 d 3 p e d 3 k Vr 
(27r) 5 2 E e 2wv c 


(2 t r) 4 <5 ( 4 ) (p M - p e - k Vll - k Ve ) 

6(E, -E e - oJvMiPn ~ Pe - *0 2 )- (2.34) 


In (2.34), 9 is the step function, 9(x) = 1 for x > 0 and = 0 for x < 0. \M\ 2 
denotes the probability of spin-averaged for initial particles and spin-summed for 
final particles, 


imp = 

spin 
q2 _ 

= 2 N fe "j7p(l - 76)«(Pm)«0 j m)7o-(1 ~ 7s)«(*^)] 

spin 

X [^(Pe)7 P (l - - 75)u(p e )] 

spin 

Q2 

= -^■Tr[^7p(l-7s)(^ -m M )7 D .(l-7 5 )] 

x Tr|>e7 p (l - 75)^7 <T (1 ~ 75)] 

= 64G 2 (Av e -p M )(A^.p e ), (2.35) 


where the electron mass was neglected because it is very small as m e < m M /200. In 
the rest frame of muon p = (m M , 0 , 0 , 0 ), we have (fc„ e -p^)(k Vtt -p e ) = |w„ e 7 n M (m 2 — 
2 m M ai e ) and thus we can write the decay rate as 


dr 


Gl d z p e d 3 k 
2 m ) 1 7 r 5 2£/ e 2w y<! 


(m 2 


2 


x <5(m 2 - 2rrifj.Ee - 2 m ) 1 w„ ( + 2 E e oj Vc (1 - cos#)), 


(2.36) 


where 0 is an angle between produced electron and anti-neutrino. Using the relation, 
d 3 p e d 3 k v< . = 4nE 2 dE e ■27rul^dui l/t dcos8, and integrating over cos 9 using 6 function, 
we obtain 


G 2 

dr = dE e du) Vc u Ve (m 2 - 2m M u„J. (2.37) 

From the condition — 1 < cos 8 < 1, the energy and E e are constarined as 
rrifj/2 — E e < < m M /2 and 0 < E e < m 4 /2, respectively. Then, the energy 
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spectrum of electron is obtained by integrating oj Vc over this region 

(2.38) 


(2.39) 

where the maximum of electron energy is E™ ax = m^/2. Using = 2.2 x 10 _6 sec 
and = 105 MeV, we get 

Gn = 1.166 x 10 -5 GeV -2 « l(T B /mJ. (2.40) 


dr 

dE e 


GlK E l 

47T 3 


l_4£e 
3 m u 


The total muon decay rate is obtained by integrating by E e 

rpmax 


r i oj< 

Tn 47T 3 




Glm% 
192tt 3 ’ 


Comparing (2.40) with (2.19), we see that G M for muon decays is almost equal to Gp 
for nuclear /3-decays and thus the weak coupling constant looks universal. Therefore, 
the origin of the weak interaction seems to be same. In fact, by taking account of 
radiative corrections (Roos and Sirlin 1971; Beg and Sirlin, 1982), we see that the 
values of G M agrees with the one of Gp within only about 2%. However, it is very 
important to notice this small difference between couplings in lepton currents and 
hadron currents, which led Cabibbo to the splended idea of the so-called Cabbibo 
mixing currents discussed later. 

Before coming to the discussion of Cabibbo currents, we just consider the decay 
of the third lepton r. The decay width of r is similarly calculated as in the muon 
case discussed above. This can be done by just replacing and G M by m r and 
G r , respectively. Then, using the relation 


T r 



Br(r —> e v e v T ) 
Br(y~ -> e~v e v fl ) ’ 


(2.41) 


and the experimental data of m T = 1776.99 MeV, = 105.658357MeV, = 

2.19703 x 10 _6 sec, r r = 2.906 x 10 _13 sec, Br(r~ -A e~P e u T ) = 0.1784 and Br(/i“ —> 
e~v e Vfj) = 100%, we can obtain G r /G M = 1.001. The result confirms again the 
universality of the weak interaction. 

In summary, the near equality of coupling constant involved in the weak leptonic 
decay of r, t leptons and in the /3-decay, i.e. G M « G r « Gp shows that there is 
a universal weak interaction constant G in all weak processes. In other words, the 
value of weak charge is universal and therefore, it is customary to represent this 
universal coupling constant G by Gp called the Fermi constant. 


2.1.4 Pion decays 

Can we understand the meson decays, too, by the V - A interaction? the answer 
is YES. We can explain them also by the V — A interaction. Let us consider the 
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pion decay as an example. A pion can decay into a muon(Sr m 100%) or an 
electron(Br = 1.230 x 10 -4 ) as 


7T -» ft +Vp, 

—^ 6 *f V t . 


(2.42) 

(2.43) 


For these decays, there are two remarkable experimental findings; (l)the observed 
p~ in the process (2.42) was right-handed. This observation results in the conclusion 
that the is also right-handed, since in the rest frame of 7 r“ being spinless, p~ 
and run into opposite direction with equal momentum and because of angular 
momentum conservation, two decay particles must be right-handed. This result is 
cosisitent with the (1 — 75 ) factor in the lepton current. (2)The observed ratio of 
branching ratios of (2.43) to (2.42) is extremely small, i.e. 


r(7T —> e + v e ) 

r(7r- -»■ ft- + j/ M ) 


1.23 x 1(T 4 . 


(2.44) 


This is completely contradictory to the prediction of the phase space effect. The 
origin of the strong suppression must be dynamical. Actually, this is due to the 
V — A structure of the lepton current. In fact, since v e is right-handed, the emitted 
electron must be also right-handed because of angular momentum conservation. But 
this is the “wrong” helicity state for light electron, because a fast moving electron 
should be left-handed as described in 2.1.2. This is not the case for a muon; a muon 
emitted in the pion decay is rather heavy and has the right-handed component, i.e. 
the “right” helicity state. 

Let us go to a quantitative discussion. Assuming the V — A interaction, the 
matrix element for these decays Tt~(q) — > (~{p) + Pt(k) where l = p or e and 
momenta of are given in parentheses, is written by 

M = ^ < 0|J; |t r~(q) > u t (p) 7^(1 - 75 H, (fc), (2.45) 


where < 0 | \F~{q) > contains all strong interaction effects, being called the pion 

decay constant f n , expressing the effect of the pion decay into vacuum, and must 
be vector or axialvector so that this factor can contract with the lepton current to 
make M be Lorentz scalar. Since a n~ is spinless, available 4-vector is q alone. 
Then we can write 


< 0| J* |tt {q) >= q^fn- (2.46) 

Substituting (2.46) into (2.45) and using q = p + k, we can write the amplitude as 

G 


M = + ~ ^) v v t (k)] , 


G_ 

y/2 


miUu t (p)( 1 - 7 s)v„ t (k), 


(2.47) 
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where in going from the 1st line to the 2nd line, we used the Dirac equation for a 
lepton l with mass and the one for a neutrino vt with zero mass. Then, using 
the standard technique taken in the calculation of the muon decay width described 
in the previous section, we obtain the result 


r(7r ->i + *<) = (i - • 

Then, we have the ratio 

rf*- ^ e ~ + p ‘\ = (^)‘( ^ 4) 2 * i.28 x 10 

r(7T- -y H~ + I/ M ) 


(2.48) 


(2.49) 


which fantasitcally agrees with the experimental result of (2.44). Furthermore, from 
the observed life-time of tt - , t = 2.6033 x 10 _8 sec, we can estimate the value of the 
decay constant f n as 


fir - 0.92 m n 


(2.50) 


which is an order of the pion mass. 

Above observation suggests us that the V — A interaction works well even for 
meson decays. 


2.1.5 Cabibbo Currents 

The universal Fermi interaction of the V —A current-current form is quite successful 
in describing observed leptonic weak decays. However, the situation is expected to 
be less clear in the treatment of semileptonic and hadronic weak decays because 
of hadronic strong interaction effects. There are many such weak decay transitions 
of hadrons including strangeness. Experiments show that the strangeness non¬ 
conserving (AS ^ 0) weak decays are relatively suppressed as compared to the 
strangeness conserving (AS = 0) weak decays. 

A modification of the weak current that explains the observed suppression of 
AS = 1 transitions compared to AS = 0 transitions, was proposed by Cabibbo 
in 1963 (Cabibbo, 1963). Instead of introducing new couplings to accommodate 
strange particle decays, he tried to keep universality by modifying the hadronic 
current. He assumed that the total weak current of hadrons flows into the AS = 0 
and AS = 1 branches, keeping the total weak current of hadrons 

=aj(°)+6jW, (2.51) 

where J^ and are currents corresponding to AS = 0 and AS — 1 transitions, 
respectively. J^ and are normalized so that the strength of the corresponding 
transitions are given by the coefficients a and b. Assuming that the total weak 


‘ T*lt-y.A.LC.A. 



The Fermi Theory of Weak Interaction 


23 


hadronic current remains unchanged, we have 


u | 2 + | 6| 2 = 1. 


(2.52) 


It is customary to write a — cos 0 C and b = sin 6 C , where 6 C is called Cabibbo angle. 
The normalization condition (2.52) is then automatically satisfied and the weak 
current of hadrons is written by a mixing of the strangeness conserving (A5 = 0) 
current and the strangeness changing (AS = 1) current 

= cos 9 C + S in 9 C JW. (2.53) 


To find the value of the Cabibbo angle 9 C , we compare the decay width of n~ 

and K~ -> The decay width for n~ p~ + is given by (2.48) 

in which G is replaced by cos0 c C? by taking account of the Cabibbo mixing effect 
presented in (2.53). The decay width for K~ ->■ p~ 4- is also given by replacing 
m„, /jt and G by rriK, Jk and sin 9 C G, respectively. Then, we have 


Y(K —> ft + Vfjf) 
r(7r- + i/ M ) 


sin 9* c f 2 K mKl 

(■-Sr) 

cos9 c ft m n \ 

(’-si) 


(2.54) 


Assuming the 5f/(3) symmetry for meson decay constants of Jk = fn and using 
the experimental result(« 1.335) for the lhs of (2.54) and the values of m M , m„, 
m,K presented in the particle data table (Particle Data Group, 2002), we obtain 


tand c ~ 0.275, 


(2.55) 


which leads to sin 9 C w 0.26. More detailed analyses were carried out with the 
results, sind c = 0.220 ± 0.002 from K~ — > tt° + e~ + v e (Braun et al., 1975) and 
sin0 c = 0.231 ± 0.003 from A p + e~ + v e (Bourquin et al., 1983). Because of 
the small values of 9 C , those decays whose amplitudes are proportional to cos 0 (: , 
are known as Cabibbo-favored decays, while those with amplitude proportional to 
sin 8 C are Cabibbo-suppressed decays. 

Let us now understand Cabibbo’s proposal at the quark level. The /3 decay 
process n —> pe~i> e at a quark level is d —> ue~v e (Fig.2-4 (a)): one of the d quarks 
in the neutron (ddu) transforms into a u quark with remaining d and u quarks as 
spectators. In contrast, in the decay process A 0 pe~i> e in which A 0 has quark 
content uds, the strange quark in A 0 transforms into a u quark (Fig.2-4 (b)). Again 
this involves charge changing weak currents but in this case there is also a change of 
strangeness at the baryon vertex and hadronic current is therefore called strangeness 
changing or AS = 1 weak current. The quark current has the same V — A structure 

= <f 7 M ( 1 - 75 )q- (2.56) 

Furthermore, we retain the universality for all particles such as quarks as well as 
leptons and use the same coupling constant for AS = 1 and AS = 0 decays, except 
for the Cabibbo angle. In weak interactions with charged curents, leptons can only 
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AS = 1 
(b) 


Fig. 2.4 


be transformed into their partners in the same doublet; e.g. e _ v e for 
and fj.~ i'n for y Vfl j. Similarly, we can group the quarks into families like 

But for quark transitions, they are observed not only within a 

family but, to a lesser degree, from one family to another via Cabibbo mixing. 
Therefore, for charged currents of quarks, the “partner” of the flavor eigenstate u 
is not just the flavor eigenstate d but a linear combination of d and s, which is 
called d'. Similarly, the “partner” of the c quark is another linear combination of 
d and s which is orthogonal to d' and is called s'. The coefficients of these linear 
combinations can be written by using a single mixing parameter 6 C as 




d' = cos 0 C d + sin 6 C s, 
s' = — sin# c d + cosQ c s. 


(2.57) can be written in the matrix form as 


d 1 

s' 



with U = 


cos 9 C sin 6 C 
— sin0 c cos0 c 


(2.57) 


(2.58) 
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U is unitary and performs the rotation of quark states, 



and 



into 


“rotated” states, 


d' 


and 


. The transitions s u as compared todfHi 


are therefore suppressed by a factor of sin 2 6 C : cos 2 6 C « 1 : 20, in accordance with 
the data. 

If we add the third generation of quarks f * V the 2x2 matrix of (2.58) is re¬ 


placed by a 3 x 3 matrix which was originally introduced by Kobayashi and Maskawa 
in 1973 (Kobayashi and Maskawa, 1973) and is called the Cabibbo-Kobayashi- 
Maskawa matrix (CKM matrix), 


d' \ 

l / U u d 

Uus 

Uub 

s' 

= Ucd 

U ca 

u cb 

b 1 ) 

' { Utd 

Uts 

Utb 



(2.59) 


The probability for a transitions from a quark q to another quark q' is proportional 
to \U qq '\ 2 , the square of the matrix element. The diagonal elements of this matrix 
describe transitions within a family; they deviate from unity by only a few percent. 
It was found from the analysis of various meson decay data that the values of matrix 
elements U ua and U c d were nearly one order of magnitude smaller than those of U u d 
and U cs . Furthermore, transitions from the third to the second generation (t -> s, 
b -> c) are further suppressed compared to transitions from the second to the first 
generation. The average values of \U U d\, \U us |, \U C d\ and \U CS \ obtained from the 
analysis of experimental data (Particle Data Group, 2002) are 


\U ud \ = 0.9735 ± 0.00108, \U ua \ = 0.2196 ± 0.0023, 

\U cd \ = 0.224 ±0.016, \U ca \ = 1.04 ±0.16. (2.60) 


The observation that strangeness non-conserving weak interactions are relatively 
suppressed as compared to the strangeness conserving weak interactions led to fur¬ 
ther extension of the concept of universality to involve weak hadronic currents. 
After all, the Cabibbo theory established the quark-letpon universality and sloved 
the puzzle on the slight discrepancy between the value of Gp (in (2.19)) determined 
from the nuclear /3-decay and the one of (in (2.40)) from the ji-decay; in fact, 
Gp = G^ cos 0 C with cos 6 C as 0.975. 


2.1.6 Difficulties in the Fermi Theory 

As we have seen in the previous sections, the Fermi theory of weak interactions, 
whose definitive form was established by Feynman and Gell-Mann as the current- 
current form with V — A currents and which was extended by Cabibbo to processes 
including hadrons such as AS = 0 and A S = 1 decays, works very well in describing 
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Fig. 2.5 


phenomena of the observed charged current weak interactions. The Fermi theory of 
its simple and elegant form could not only explain many experimental data of decays 
but also matches well with the physical principle such as universality, algebraic 
properties embodied in Cabibbo currents, and so on. 

However, it is now also well known that the Fermi theory cannot be a complete 
theory, even if it works well as phenomenology. When we apply it to scattering 
processes, the lowest order approximation violates the unitarity bound. Further¬ 
more, unlike the quantum electrodynamics for electromagnetic interactions, the 
Fermi interaction is not a renormalizable theory and thus we cannot manage those 
higher-order contributions to yield a finite outcome. 


2.1.7 Unitarity violation 

To see how the Fermi interaction violates the unitarity bound, let us consider the 
neutrino-electron scattering (see Fig. 2.5) 

i 'e(fc) + e - (p) e~(p') + F e (fc'), (2.61) 


whose amplitude is given by 

M = ^ - 7fc)«e(p)] [«e(p / )y*(l - 75)«„.(*)], (2.62) 

from which, by neglecting the electron mass m e and following the standard technique 
for calculating cross sections, we can obtain the differential cross section as 


do_ _ |M[ 2 _ G% 
dCl 647r 2 s 47 t 2 S 


(2.63) 


where s = ( k+p) 2 = AE 2 = 4 k 2 ( E(k ) is the CMS energy (momentum) of the v e +e~ 
system). Integrating over the solid angle 12, we obtain the total cross section for 
this process 

<j = -E-s. (2-64) 
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Fig. 2.6 


On the other hand, as is well known, every scattering cross section can be decom¬ 
posed into partial waves. For spinless particles, we have in general 


da 

dCt 


= um 2 = 


2 ik 


J2(2e + l)f e P e (cos6) 


t=o 


(2.65) 


where ft is the partial wave amplitude for an orbital angular momentum t. For 
the pointlike Fermi interaction, only f 0 contributes to v e - e~ scattering (2.61) and 
there is no angular dependence (with P 0 (cos 6) = 1). Then, we expect to have 

dCl = ' ^ 2 ' 66 ^ 

However, since unitarity requires |/f| < 1 for every partial wave, we obtain the 
upper bound for the diffrential cros section as 


da_ _ 1 _ 
dU - 4 E 2 ‘ 


(2.67) 


Then, the total cross section is also bounded similarly as 


a = 


/ 


da 

dfl 


dSl = 47T 


da it 
dtt~EP' 


( 2 . 68 ) 


Therefore, the prediction of (2.64), being in the lowest order of the Fermi interaction, 
violates unitarity bound at some high energy 


£ ^/l;’“ 370GeV - <2 - 69) 

One might suppose that this is due to neglection of higher-order corrections of 
Fermi interactions. But this is not the case. For example, if we take into account 
the 2nd order diagram shown in Fig. 2.6, we will find that the calculation leads to 
a divergent result. We cannot cure this disease from those higher order corrections. 
Rather, we can see that the root lies in the Fermi interaction itself. 
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Fig. 2.7 


2.1.8 Non-renormalizability 

Another serious difficulty in the Fermi theory is that it is not a renormalizable 
theory, which is very different from the case of quantum electrodynamics(QED), 
being renormalizable. As is well known, we have divergent integrals originating 
from higher-order diagrams even in QED as well. However, in QED we can remove 
those divergences by renormalizing the mass and charge into the observed physical 
values. On the other hand, in the case of the Fermi interaction, if we try to follow 
the renormalization procedure analogously as in QED and go to the next higher 
order diagram, we will encounter a more serious divergence, which needs further 
renormalization constants. Then, when we consider all higher order diagrams, we 
need an infinite set of renormalization constants. That is, the theory cannot be 
renormalized, (see, for example, Aitchison and Hey, 1989) 

By taking the reaction v e + e~ v e + e~, let us see what is going on. The 
lowest order diagram for this reaction is given in Fig. 2.5, which contributes to 
the amplitude with the order of G F . The 2nd order correction comes from Fig. 
2.6. There we have two fermion propagators, each one behaves as 1/A: for large 
internal fermion momentum k, and thus due to these internal fermion propagators 
we have a loop integral contribution to the amplitude with G 2 F f jjr, which is di¬ 
vergent. Similarly, the 3rd order diagram (Fig. 2.7) contributes to the amplitude 
with G F (f jjr) 2 , which is more seriously divergent. After all, we have a series of 
expansion with Gf f , where the higher order terms yield more and more severe 
divergence. This is because Gf has a dimension of [mass] -2 ; each term has a new 
Gf in the expansion series, which must be compensated by a divergent factor f 
with [mass] 2 dimension in each term to keep the dimension of the amplidude. 


2.2 Intermediate Weak Boson Model 

One of the interesting ideas to attempt to take away those difficulties is the in¬ 
termediate weak boson exchange model. The weak coupling constant G{= G F ) 
appearing, for instance, in (2.27), (2.32) and (2.62) has a dimension of [mass] -2 
(see (2.40)). This fact suggest us that we can develop further an analogy between 
the weak interaction and the electromagnetic interaction by postulating that the 
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weak interaction is mediated by a massive weak boson, just as the electromagnetic 
interaction is mediated by a massless photon. For example, assuming the W boson 
exchange between two charged lepton currents (Fig. 2.8), let us write the decay 
amplitude for gT —>• e~ + v e + as 


M — - 


9 - 1 - 7s. 

-<r + y- 

1V1 w 

9 - 1-75, 

2 

cf-M 2 , 

lX /2 e7ff 2 


(2.70) 


instead of (2.32), where g is the dimensionless coupling constant and q represents 
the momentum of an intermediate W boson with mass Mw- This form is quite 
analogous to the electron-proton scattering amplitude mediated by a massless pho¬ 
ton 

M — - [eu P 7 M Up] ~ [-eUej^Ue]. (2-71) 

We see that a photon propagator between two electromagnetic currents with charge 
e is replaced by a IF-boson propagator between two charged currents with coupling 
constant -4=. Since the Fermi interaction works as a contact interaction at zero 
range, the W boson must be very heavy, unlike a massless photon. Furthermore, 
at low energy where q 2 is much smaller than M(y, the Fermi interaction works very 
well and thus (2.70) should reduce to (2.32) with 


G f __ g 2 
V2 8M^’ 


(2.72) 


From (2.72), the fact that the weak interaction is weak with small Gf is considered 
to be not because the coupling g is small, but because Mw is large with g « e. This 
suggests that the electromagnetic and weak interaction might be formulated into a 
unified theory. In fact, later this idea was beautifully realized in the non-Abelian 
Gauge theory called Glashow-Weinberg-Salam(GWS) theory, which is the one of 
main subjects of this textbook and will be discussed in detail in the next Chapters. 

But before coming into the GWS theory right now, it is instructive to learn a few 
things about the intermediate weak boson model. One is related to the problem 
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Fig. 2.9 


of unitarity bound. As discussed above, the Fermi interaction, i.e. the contact 
interaction leads to violation of unitarity bound. Then, can the intermediate weak 
boson model cure this disease? The answer is NO. For example, let us consider the 
process, in the lowest order, (Fig. 2.9) 


v»(p) +MP') -> W + (k,X) + W-(k',X'), (2.73) 


which is predicted to exist in the intermediate weak boson model. In (2.73), the 
momentum of each particle is given in parentheses, and A and A' denote the polar¬ 
ization of W + and W~, respectively. The invariant amplitude for this process is 
given by 


M (A,Y) = {^fef\ k ')e< x \k)v{p')Y 


1~75 

2 


U>~$ + m n) 

(p - k) 2 - m 2 


1~75 

2 


u(p), 

(2.74) 


where s^^k) and c^ A ) (k 1 ) are the polarization vector of W + and W~, respectively. 
From this amplitude, one can calculate the differential cross section at high energy 
as 


da 

dfl 


&F - 2 op2 
8^ Sin 6E ’ 


(2.75) 


where E and 9 are the energy and the scattering angle of the incoming v^ in the 
CMS, respectively. Integration over the solid angle results in the total cross section 


a = 



(2.76) 


where s = AE 2 . The cross section increases with s and hence violates the unitarity 
bound at some high energy. Therefore, the disease of unitarity violation cannot be 
cured even in the intermediate weak boson model. 

Another difficulty, i.e. the non-renormalizability of the Fermi interaction also 
cannot be solved by the intermediate weak boson model. To see this apparently, let 
us consider an example of scattering presented in Fig. 2.10. In this figure, (a) is the 
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V e V e 



(b) 


Fig. 2.10 


two photon exchange correction to the QED process, e + + e _ —> e + + e~, while (b) 
is the two W boson exchange correction to the weak process, u e + e~ —» u e + e~. 
For the photon exchange diagram (a), since the photon propagator behaves as 1/fc 2 
for large internal momentum k, we have a loop integral f for this amplitude, 
which is finite. On the other hand, for the W boson exchange diagram (b), since 
the W boson is massive and its propagator has a longitudinal component 


-1 


fc 2 -M&, 


/ knky 

{9n» ~ 


m 


w 


(2.77) 


in which k dependence vanishes at large k, the corresponding loop integral becomes 
f \£ > which is quadratically divergent and the situation does not change from the 
case of the Fermi interaction, i.e. the contact interaction. After all, the intermediate 
weak boson model cannot be renormalized to get a finite result. 

After a long struggle, an extremely elegant and consistent theory without theo¬ 
retical dificulties of unitarity violation and non-renormalizability was discovered for 
the lepton family by Weinberg and Salam in 1967 (Weinberg, 1967; Salam, 1968). 
The theory was extented to quark families successfully and, combined with quantum 
chromodynamics(QCD) which is the fundamental field theory of strong interactions, 
was developed as the standard model of the particle physics. But before going into 
the Glashow-Weiberg-Salam(GWS) theory, we need some preparations, which will 
be discussed in the next Chapter. 


Problems 

2.1 Show that the coupling constant Gi in (2.5) must be real if the interaction 
Hamiltoniam (2.5) for the /3-decay is time-reflection invariant. 
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2.2 Using (2.13), prove (2.14). 

2.3 In the nonrelativistic approximation, one can see that the Fermi transition for 
/3-decay processes occurs for Oi = V and A, and the Gamow-Teller transition does 
for Oi = A and T, while Oi — P can contribute to the Fermi transition only in the 
order of ^ and thus can be neglected. Show these results using the nonrelativistic 
expression for the Dirac spinor of nucleons. 

2.4 Starting with (2.45), derive (2.48). 

2.5 Stating with (2.62), drive (2.63). 

2.6 Starting with (2.74), drive (2.75). 





Chapter 3 


SYMMETRIES AND THE GAUGE 
THEORIES 


One of the most fundamental principles in particle physics is that interactions among 
fundamental particles are described by symmetry principles. The invariance of the 
Lagrangian under certain symmetry transformations leads to a set of conservation 
laws. In addition to space-time symmetries such as Lorentz invariance, parity invari¬ 
ance, time reversal invariance etc., the internal symmetries such as isospin, flavor, 
color etc. have been also known. The development of our understanding about 
the fundamental interactions is based, to a large extent, on our understanding of 
such underlying symmetries of Nature. It is remarkable to know that there is a 
connection between exact symmetries and conservation laws and to see that the 
requirement of local gauge invariance can serve as a dynamical principle to guide 
the construction of interacting field theories. 

In this Chapter, we first discuss the relation of the global symmetry to the 
Noether’s theorem, which connects the invariance of the Lagrangian under a con¬ 
tinuous symmetry transformation to a conservation law and the conserved quantum 
numbers. Then the local gauge theory will be developed with an example of quan¬ 
tum electrodymanics(QED), being an Abelian gauge theory with U( 1) symmetry. 
A non-Abelian Yang-Mills gauge theory is also discussed with an example of SU (2) 
symmetry. Furthermore, an idea and an important result of the spontaneous sym¬ 
metry breaking(SSB) and the Goldstone theorem will be discussed. Finally, we will 
discuss the Higgs mechanism, which is realized in the mechanism of SSB with local 
gauge symmetries. 


3.1 Global Symmetries and Noether’s Theorem 

In the field theory, fields are used to describe the fundamental particles. Dynamics 
among those particles are given by the Lagrangian density* C(<j)(x), d^4>(x)) which 
depends on the field <f>(x) and its derivative d tl (p(x). Fields are arranged so as to 
realize various symmetries mentioned above. The Lagrangian formalism provides 

‘The Lagrangian density is often called, simply, Lagrangian. We follow the same usage. 
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a systematic way of identifying these symmetries and extracting the constants of 
motion associated with these symmetries. 

Let us consider a system composed of a set of n independent fields* 0 a (x) (a = 
1,2, and let the Lagrangian of this system be invariant under the trans¬ 

formation of a certain group G. The equation of motion for </> a (x), called the 
Euler-Lagrange equation, 

Wa ~ ^ d(d^ a ) =0 ’ ( a=1 - 2 ’---> n )’ ( 3 - 1 ) 

is generated from the requirement of least action, 

SS = 6 J d 4 z£( 4 > 0 (x), dMx)) = 0 . (3.2) 

We define the “momentum” fields which are canonically conjugate to field variables 
<p a (x) as 


7Ta(z) = 


dc 

d(do<pa) ’ 


(3.3) 


and assume finally the canonical commutation relations to quantize the system 


[cf> a (x,t),<t> b {y,t)\ = [n a (x,t),n(y,t)] = 0, (3.4) 


[4>a(x,t),nb(y,t)] = iSabS 3 (x - y). (3.5) 

Now, we are interested in the variation of C under the transformation <p a -t (f> a +S(j) a . 
The result turns out 



by using the Euler-Lagrange equation (3.1). When the transformation parameters 
are independent of the space-time point x , it is called the “global” transformation. 
As an example, we introduce a transformation under a special unitary group G = 
SU(n) as 

Ha = -i&TUM*) Ha -> <P' a = (3-7) 

using x-independent infinitesimal parameters 9 l and an n x n matrices T l , where 
a,b = 1,2, • ■ ■, n and i = 1,2, • • •, n 2 — 1. Here the summation over over i and also 

tFor simplicity, we consider the case of boson fields for the moment. For the case of fermion fields, 
we need no change in the following discussion if commutation relations (3.4) and (3.5) below are 
replaced by anti-commutation relations. 
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over b is implied. T l is called generators of the SU(n) group. Then, when we define 
the current 

(3 - 8 > 

(3.6) can be written as 

SC = 6 i d ll j^(x). (3.9) 

Thus, if the transformation (j) a —► <S> a + 54> a leaves C invariant, i.e. SC = 0, we have 
the current conservation law, 

d^{x) = 0. (3.10) 


By integrating this equation over all space, 

Jd 3 x(J- t j i0 + V-A =0, (3.11) 

and by assuming for j'(x) to vanish at spatial infinity, we are led to, by using the 
Gauss’s theorem, the charge conservation law, 



(3.12) 


where the “charge” is defined by 


Q\t) = J d 3 xj l0 (x,t). 


(3.13) 


One can see that a symmetry of the Lagrangian under a field transformation implies 
conservation laws; this is called the Noether’s theorem (Noether, 1918) and the 
current j lfJ '(x) is called Noether currents. 

In quantum theory, we have the Heisenberg equation 


d,r(x) = i[p )i ,r(x)}, 


(3.14) 


where is the four-momentum operator. If the current is conserved, one can 
obtain, by integrating over all space, 

dO i 

= = 0, (3.15) 

with H = Po, which is the Hamiltonian. Therefore, it turns out that Q l commute 
with Hamiltonian H and the states with Q l (i — 1,2, • ■ •, n 2 — 1) degenerate with the 
same energy so as to make a multiplet. Using the canonical commutation relations, 
one can easily show that the charges Q l satisfy the following relation, which is the 
Lie algebra, 

[Q i ,Q j ) = ifijkQ k , (3.16) 
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since the generators T l of SU (n) always satisfy the same Lie algebra, 

[T i ,Ti] = if ijk T k , (3.17) 

where fijt is the structure constant of the Lie group and totally antisymmetric with 
respect to i, j and k. That is, the charges Q l satisfy the same Lie algebra as the 
generators T l of the group SU(n) do. Furthermore, for an infinitesimal #”(<C 1), 
one can derive the following relation, using canonical commutation relations of (3.4) 
and (3.5), 

[Q i ,<f>\ = -T i <t>. (3.18) 

This relation shows that Q l is the generator of an infinitesimal transformation 

<f> -> <t>' = e iffiQi <j)e- i9iQi = e~ i9iTi 0, (3.19) 

as can be seen by comparing the following relation 

e ieiQi (j>e- ieiQi =<f> + i6 i [Q i ,cf>] + 0(6 2 ), (3.20) 

with 

e - i0iTi 4> = 4>-iO i T i <l> + O(O 2 ). (3.21) 

In summary, the Lagrangian is a fundamental object describing the dynamics 
among elementary particles. It has various symmetries which are represented by 
transformation groups. The Lagrangian should be constructed to be invariant under 
those transformations. The invariance of the Lagrangian under a global transforma¬ 
tion specified by a unitary group results in a set of conserved charges Q l (Noether’s 
theorem) and these charges become a generator of the Lie algebra of the group. 


3.2 Local Gauge Symmetries and Gauge Fields 

The properties of strong, weak and electromagnetic interactions appear to be un¬ 
related at low energies. For example, they have quite different coupling constants. 
However, there is a possibility that at some extremely high energy the coupling 
constant may converge to a single value and that interactions among elementary 
particles could be explained in terms of single unified field. Glashow, Weinberg 
and Salam made a major breakthrough along the path to unification by unifying 
the weak and electromagnetic interactions. The most significant step in this direc¬ 
tion is the realization that all fundamental interactions are invariant under local 
gauge transformations and the hope is that gauge theories will provide a basis for 
a comprehensive unification of fundamental interactions. 

The standard model of particle physics refers to three quantum gauge theories 
which describe electromagnetic, weak and strong interaction of elementary particles. 
All these theories are renormalizable and are based on certain symmetries, which 


~Pufuc. %a£/uuHa£uroZ PAy.lrc.i. 



Local Gauge Symmetries and Gauge Fields 


37 


are classified into two ways, (1) manifest or unbroken and (2) hidden or sponta¬ 
neously broken. The electromagnetic interaction is unbroken local 1/(1) symmetric, 
the weak interaction is spontaneously broken local 517(2) x 17(1) symmetric and 
the strong interaction is unbroken local SU (3) symmetric. The non-Abelian gauge 
theories of weak and strong interaction is the generalization of quantum electrody- 
namics(QED), which is the Abelian local U( 1) gauge theory. 


3.2.1 Quantum Electrodynamics-U (1) model- 

Let us begin with the Lagrangian C describing the field of a single free fermion with 
mass m. In the natural unit (ft = c = 1), it is expressed as 

C(x) = t}(x ) - m ) ip(x). (3.22) 

from which the Dirac equation is obtained by taking variation for x ); 

- m)ijj(x) = 0, (3.23) 

where tp(x) is a 4-component spinor field of the fermion at point x in four dimen¬ 
sional space-time and d^ = d/dx* 1 (p = 0,1,2,3). 7 M are the 4x4 Dirac matrices 
and the summation over p is implied. It is understood that m in the mass term is 
multiplied by a 4 x 4 unit matrix I. 

First consider a global transformation for this system. As discussed in the 
previous section, it is described by a phase change of the fermion field 

il>'(x) = e~ l6 ij}{x) = Ugip(x). (3.24) 

The Lagrangian (3.22) is invariant under this phase transformation where 6 is a 
constant everywhere in space-time. That the Lagrangian is invariant under the 
global transformation means that the phase 6 of the fermion field is not observable. 
Clearly, transformation (3.24) is unitary, i.e. U^Ug = 1 and it is Abelian, i.e. 
Ug 1 Ug 2 = Ug 2 Ug l . That is to say, transformation (3.24) is global, unitary and 
Abelian. This transformation is denoted by C7 (1). 

Now, let us study what happens when this global symmetry is made local. It is 
well known that the electromagnetic dynamics has a local gauge symmetry larger 
than a global symmetry. A local gauge transformation is defined as 

ip'(x) = e~ ie ( x V(z), ${x) = e ie ^tp{x), (3.25) 

by introducing a ^-dependent phase parameter 6{x) varying locally from point to 
point in space-time. Then, the kinetic energy term is no longer invariant under this 
transformation because an additional term being proportional to d fl 6(x) appears 
from the derivative of the transformed fermion field. The new Lagrangian under 
this transformation becomes 

C £ — / ip'(x)(ij ,1 df 1 - m)ip'(x) 
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= - m)ip(x) + ip(x)'f fl 'ip(x)d tl 8(x) 

= C + j^(x)d^8(x), (3.26) 

where j fl (x) — ip(x)'Y fi ip(x) is the vector current carried by the fermion. Hence, the 
Lagrangian is not invariant under the local transformation. Only if d^8(x) = 0, i.e. 
8(x) is independent of x, £ is invariant. Then, if we make the following replacement 
in (3.22) 

dfj, - ieA^x), (3.27) 

by introducing some vector field A we get instead 

£->£ + ejM„, (3.28) 

where e is the electromagnetic charge of the fermion which is negative for an electron. 
Thus, if we define a new Lagrangian by replacing in (3.22) by of (3.27) 

C{ip,ip,A^) = - m)ip 

= ^(z7 m (9 m - m)ip + exji'y ll il>A ll , (3.29) 

then this £ becomes invariant under (3.25) if at the same time we make the replace¬ 
ment 

-* A 1 ^ = Ay, - ^d^8{x) or SA^ = -i<9 M 0(a;), (3.30) 

which precisely cancels the additional unwanted term in (3.26). _D M defined above is 
called a “covariant derivative”, a terminology borrowed from the general relativity. 

From the above observation, one can prescribe the procedure for getting the 
gauge invariant Lagrangian in a little different way; starting from an original La¬ 
grangian (3.22) which possesses a global symmetry and making the replacement 
(3.27), we can get a new Lagrangian 

£ = - m)ip, (3.31) 

which is invariant under the local gauge transformation by requiring 

{D^{x))' = e -" ( »)U M V»(*) or S(D^) = -i8(x)D^(x). (3.32) 

The requirement (3.32) leads to (3.30), where 8(x) is the ^-dependent infinitesimal 
phase parameter. Then, electromagnetic dynamics is made invariant by introducing 
a spin 1 vector (gauge) boson field A M called the photon through the covariant 
derivative, which is called the “minimal coupling”. It is very important to know that 
in the gauge invariant theories the interaction between gauge bosons and particles 
(fermions and/or bosons) is uniquely determined only through the minimal coupling. 

The Lagrangian which is invariant under local U( 1) gauge transformation is, 
therefore, given by (3.31). However, this is not the complete Lagrangian for de¬ 
scribing the whole system. We have to add the kinetic energy term and mass term 
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of the electromagnetic field A^{x), which should be also gauge invariant. As for the 

mass term, the local gauge invariance necessarily leads to massless photon because 
2 

the mass term z^-A^A 1 * violates the local gauge invariance, unless m = 0. It is 
well known that the kinetic energy term of the electromagnetic field is given by 
— \F I1V F ,1V , where the electromagnetic field-strength tensor is defined by 

- d v A„. (3.33) 

The coefficient — | is necessary for the requirement that the Euler-Lagrange equa¬ 
tion is just the Maxwell’s equation. The kinetic energy term is obviously gauge 
invariant because SF^„ = 0 under the transformation (3.30) as easily checked by 
directly introducing (3.30) into (3.33). It is also seen in a little different way; the 
covariant derivative satisfies 


[D m , D v \ ip = -ieF^xp. (3.34) 

From (3.32), it follows that ([D^^D^tp)' = D v \ ip) and thus F^ip 1 = 

e~ i6( - x ' ) F flv tp = F^ip 1 , that is, 

Therefore, the complete gauge invariant Lagrangian for the system of an electron 
and a photon takes the following form, 

Cqbd = ^(x)(* 7 m £V - m)iP(x) - -^F^F^. (3.35) 

The field theory with Cqbd is called quantum electrodynamics(QED). The gen¬ 
eralization of classical electrodynamics to describe quantum effects had culminated 
in the development of QED as the theory of interaction of photons and electrons. 

In QED, the electromagnetic interactions are mediated by massless photons. 
Photons are quanta of the electromagnetic fields, being massless with no charge and 
with spin 1, and they do not interact amongst themselves. Theory of gauge fields 
represents a class of theories which share and generalize characteristic properties 
of the Maxwell’s theory of electromagnetic fields. There are two characteristics of 
Maxwell’s theory: (1) Forces created by gauge fields, i.e. photons, are long-ranged 
and obey inverse square of distance as realized as Coulomb force. (2) Force is 
proportional to the quantum number of its source and that quantum number, i.e. 
the charge, is conserved. 

Typical physical processes due to the electromagnetic interactions are shown 
in Fig. 3.1: (a) a fermion emitting or absorbing a photon, (b) annihilation of a 
fermion-antifermion pair to produce a photon, (c) a fermion scattered by another 
fermion in a two step process in which a photon emitted by the first fermion is 
absorbed by the second. 

The principle of gauge invariance has come to be recognized as most powerful 
guiding light for our understanding of not only QED but also probably all interac¬ 
tions. 
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(c) 

Fig. 3.1 


3.2.2 Yang-Mills Gauge Theory-SU(2) model- 

In 1954, Yang and Mills (Yang and Mills, 1954) proposed that the 17(1) gauge theory 
of electromagnetic interactions could be generalized to the non-Abelian gauge the¬ 
ories which are invariant under any non-commutative continuous symmetry group. 
They chose the isospin of 517(2), which was familiar at that time. Therefore, the 
original Yang-Mills non-Abelian gauge theory is an extension of the 17(1) Abelian 
gauge theory of QED to the internal isospin SU (2) symmetric theory. The formal¬ 
ism can be easily generalized to more general cases of 517(n) with n > 3. 

We have seen that the electromagnetic fields are described by an antisymmetric 
tensor which is made of a vector potential A M . Now we add an isospin index 
to A m and and ask the following question; Under what condition is a theory 
invariant under a space-time coordinate and isospin dependent phase transforma¬ 
tion? 

Likewise in (3.25), one can write an ^-dependent 517(2) transformation (phase 
rotation) as 


il>(x) —»■ ip'(x) = l/y>(x), (3.36) 

U = (3.37) 

where the summation over i is implied, g is the coupling constant associated with 
the group 517(2). t 1 (i = 1,2,3) are three 2x2 Pauli matrices and 6 l (i = 1,2,3) 
are three real parameters corresponding to three generators t*/2, (i = 1,2,3) of the 
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group SU(2). 


t 1 ri 

T’T 



(3.38) 


where the summation over k is implied. £ijk, being the totally antisymmetric Levi- 
Civita symbol with £123 = 1, is the structure constant of the group 517(2). Obvi¬ 
ously these three generators do not commute, i.e. SU( 2 ) is non-Abelian, hence the 
gauge theory in question is called non-Abelian, ip is both a spinor and an isospinor. 
Thus, instead of a single fermion field, we start with a doublet Dirac field 


^ = (tA $2), 

^0 = ^ 7 °- («= 1 , 2 ) 


Originally, Yang and Mills chose ip = 


P 

n 


doublet where p and n denote the 


proton and neutron, respectively. However, one can also choose ip = 


u 

d 


doublet 


as well where u and d denote the up-quark and down-quark, respectively. Here, the 
coupling strength g is a real constant to be determined from experiment (analogous 
to e in QED). We insist that the Lagrangian is invariant under transformation (3.36) 
and accordingly we encounter a number of additional complications, due to the fact 
that there are now three orthogonal symmetry motions which do not commute with 
one another. To construct a Lagrangian that is invariant under (3.36), we must 
again define a covariant derivative that transforms in a simple way. In analogy of 
QED, it is natural to introduce three independent gauge field potentials acting in 
different directions, namely, A^(x) for i = 1,2,3 and require that operation on ip is 
only through the covariant derivative 


- igA^, (3.39) 

where = 5Zi=i is a matrix-valued gauge field. Likewise in QED 

discussed above, the covariant derivative should satisfy 

(D^ipy = D'^ip' = U(D^iP), (3.40) 


which leads to 


d; = ud^u- 1 . 


Then, if we demand for A M to transform as 


A' li = UA lt U- 1 --(d lt U)U-\ 


(3.41) 


(3.42) 
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the covariant derivative (3.39) satisfies the expected transformation property, 
(D^ipy - (d^-igA'JUip 

= (d^UW + Ud^ip - ig UA^U- 1 - -(d^U- 1 U4> 

= U{d^ -igA^) 

= U(D^). (3.43) 

For an infinitesimal transformation, 

U = 1 -i6 + 0{6 2 ), 


U- 1 = l + i0 + O(6 2 ), 

(3.44) 

where 9 = X)i=i Y®' ~ 2 ' ^ (3-42) results in 


Aj, = A n~i 6, A p - 

(3.45) 

Using (3.38), we obtain 


K = A i + e ijk e j A l ;-- g d lt e i , 

(3.46) 

or 


S. < = e iih e j A* - fa*. 

(3.47) 


To get the gauge field-strength tensor like the electromagnetic field-strength tensor 
of (3.33) and its transformation rule under an infinitesimal transformation 
(3.44), we generalize (3.34) as 


[D m , D v ] ip = -igF^ip, (3.48) 

where F= ]T)f = i = f with the gauge field-strength tensor F^ v defined 


Fl„ = A£ - 0„Af, + ge ijk A^A k v . 

(3.49) 

From the relation 


([D^DW = U{[D lt ,D v ]1,) i 

(3.50) 

which follows from (3.40), one can easily derive 


F= UF^ip 

(3.51) 

or 


F^ = UF fll/ U~ l , 

(3.52) 
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which leads to the following transformation relation of the field-strength tensor 
under an infinitesimal transformation (3.44), 

SF;„ = e ijk VF k „. (3.53) 

Note that F* v transforms nontrivially unlike the case of QED where F^ is invariant 
because of no structure constant in an Abelian group. In analogy of QED, we 
construct the kinematic term for the gauge field A J, as 

~^Tr(F M „ • F MI/ ) = -\ £ Tri^F^F^) = -\f^ (3.54) 

i,j =1 

where the 2nd equality is due to the relation Tr(y-y) = One can easily see 
that this term is invariant under the gauge transformation as shown in the following; 

6{Fl v F^ v ) = 2 SF^F 1 ^ 

= 2 eijkVF^F^ = 0, (3.55) 

where the last equality is due to the totally anti-symmetric property of £ij k . There¬ 

fore, we can adopt (3.54) as the kinetic term for the gauge field A*. 

In summary, we can write down the gauge invariant Lagrangian for a fermion 
with mass m in the SU (2) symmetric world as 

C = C F + C G , (3.56) 

Cf = - m)ip(x), 

c G = -iiv(fv-F n=-\n»F i,iv , 

where C F and Cg are represent the Lagrangian for the fermion and the kinetic 
term of gauge fields, respectively. The covariant derivative is defined by (3.39) with 
3 gauge fields A ’ (i — 1,2,3). Transformation property of the gauge field A 1 ^ is 
given by (3.46) or (3.47). The gauge field tensor F^ v is defined by (3.49) and its 
transformation rule under 517(2) gauge transformation is given by (3.52). Change 
under an infinitesimal transformation (3.44) is given by (3.53). In (3.56), the mass 
term of gauge fields does not appear again because it violates the gauge symmetry. 
Some comments are in order for (3.56). (1) Contrary to QED, the kinetic term 
of gauge fields Cg contains 3 gauge boson interactions with the same coupling 
constant g as the one of the gauge field to fermions. It contains also a 4 gauge 
boson interaction with the coupling g 2 as follows; 

C G = -\d^Ai(d^A iv - d v A^) - ge iik A\A? v d»A hv - 9 ^e ijk e ilm A^A k v A^A™. 

(3.57) 

(2) The Yang-Mills theory, i.e. the non-Abelian gauge theory is not the free theory 
even without matter fields because it contains self-interactions among gauge fields. 
This is very different from the Abelian gauge theory like QED, where there is no 
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self-coupling of photons. (3) As long as we demand the gauge invariance alone, we 
can add renormalizable Yukawa interactions such as, 

Cy = Gyip{x)i(>(x)^(x) (3.58) 

in which the scalar field (j)(x) is introduced for Cy to be invariant under the gauge 
transformation. There is no principle to determine the form of the Yukawa term 
except for the requirement of gauge invariance. Gy is also totally unconstrained; 
there is no other way to determine it from experiments. 

Unfortunately, this theory (3.56) is not useful for weak interaction because it 
gives identical coupling to right- and left-handed fermions and leads to the parity 
conservation. To preserve gauge invariance, it is essential to have massless gauge 
bosons A 1 ^, which should give weak interactions of infinite range like electromagnetic 
case, being contrary to the real physics. Only if the gauge symmetry is broken by the 
inclusion of mass term, it becomes possible to achieve agreement with experiment. 


3.3 Spontaneous Symmetry Breaking and Goldstone Bosons 

Nature seems to possess various types of symmetries such as geometrical (Lorentz 
invariance, parity and time reversal invariance etc.) and internal (isospin, flavor 
and color, etc.) symmetries, discrete and continuous symmetries and so on. Some 
of them are exact symmetries and others are approximate symmetries. Here we are 
concentrated on the symmetry and its breaking in the physical world. In the field 
theory, dynamics of the physical world is described by the Lagrangian. There are 
two ways to discuss the symmetry breaking in the field theory; (l)One way is to 
add a symmetry breaking term to the symmetric term by hand as 

£ = G 9 ynx d* ^breaking • (3.59) 

Examples are seen, for example, for the case of an approximate symmetry such as 
the isospin 517(2) or flavor 51/(3) symmetry and so on. This type of symmetry 
breaking is useful when the symmetry breaking term is small and the perturbative 
treatment is meaningful. In this case, C recovers the exact symmetry for vanishing 
Cbreaking ■ However, this case is rather artificial because there is no fundamental 
principle to determine the exact form of Cbreaking ■ (2) Another way is the one called 
a hidden or a spontaneous symmetry breaking(SSB), where the Lagrangian remains 
symmetric under certain group transformation while the physical vacuum is made 
non-invariant. It is well-known that there are many examples of the SSB both in 
classical and quantum physics such as bent rods under a strong force, infinite fero- 
magnets, crystal lattices, superconductors, etc. Here we are interested in the field 
theory model of the SSB. 


• Goldstone theorem 
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Let us consider the U( 1) global invariant Lagrangian composed of a complex 
scalar field cj)(x ) and <j)*(x), 

+ (3.60) 

The system is equivalent to the one described by the following Lagrangian composed 
of 2 real fields 951 and 922 which are related to </> and </>* as (j) = (fi + if 2 )/V^ and 
<£* = (<Pi - iip2)/V2, 

£ = ^d^id^fpi + ^d li ip 2 d IJ, tp2 ~V(f\ + ip\). (3.61) 

(3.61) has the 0(2) symmetry, that is, the Lagrangian of (3.61) is invariant under 
the following 0 ( 2 ) transformation, 


( Vh ^ f ‘Pi \ = f cos0 -sin0 A / ipi \ 

V <P2 J V ¥>2 / V s[nS cos# ) \ <P 2 ) 


(3.62) 


and one can see that the t/(l) symmetry is equivalent to the 0(2) symmetry. The 
potential V(<f>*4>) = V(<p 2 + 9 o\) must have the following properties for the theory 
to be meaningful; V is (1) at most the 4th order of the fields in order to ensure 
renormalizability of the theory and ( 2 ) bounded below for change of ((f)*(ft) 1 / 2 ~ \<f>\, 
so that the theory has a stable ground state. A typical example of V is given in 
(3.60) with A > 0. 

In quantum field theories, particle excitations of a field are defined as quantized 
fluctuations of the field about its lowest energy state, i.e. the vacuum state. The 
constant value of the field corresponding to the lowest energy state is called the 
vacuum expectation value (VEV), i.e. (0|<£|0) = <f> 0 . To find the particle spectra, 
we expand the potential about its minimum corresponding to the lowest energy 
state as 


V(ipi,<p 2 ) — V{<fQi,ipo2) + ^ ( Vj- ) (‘fa - fOa) 

0=1,2 \ 0i Pa/ 0 


+ ; E 


a,6=1,2 


d 2 v 

d(p a dip b 


(fa ‘PQa)(‘Pb fob) "t" ' ' ‘ , (3.63) 


where <j> 0 = ( 9201 ,¥> 02 ) is the VEV of cf> = ( 921 , 922 ), i.e. foa = (0|9?a|0) (a = 1,2). 
Since the potential V has its minimum at <p = (f> 0 , the 2nd term of the r.h.s. of 
(3.63) is zero. The factor ) = m 2 b in the 3rd term is called the mass 

matrix which is diagonalized to generate the particle spectrum. 

Now we have 2 possible cases of V. (1) One is the case where the vacuum is 
unique and is called “Wigner phase”. In this case, only one vacuum state (9201 = 
0,9202 = 0) is realized as shown in Fig. 3.2(a). Let us take the parameters m 2 and 


"Pufuc. TtlxLtll-€!J*L€Vtl.C-Cll. 



46 


SYMMETRIES AND THE GAUGE THEORIES 


vm v(\ v \) 



(a) (b) 


Fig. 3.2 


A for the potential in (3.60) to be positive, m 2 > 0 and A > 0, 

V(<p\ + ¥>l) = + fl) + ^(¥>i (3-64) 

and require the following condition for the vacuum, 

= m 2 tpoi + \<poi(tpoi + <£ 02 ) = 

0 

(J^Q = m 2 ip 02 + A^o2(<^oi + ¥> 02 ) = 0. (3.65) 

Then, we are led to the unique vacuum ¥>oi = ¥>02 = 0. The mass matrix becomes 
diagonal in this case 




(3.66) 


which means that ipi and tpi have the same mass m as already seen from (3.64). 
(2) Another one is the case where the vacuum is not unique and called “Nambu- 
Goldstone phase”. This case is realized, for example, for continuously or infinitely 
degenerated vacuum states with ¥>01 i=- 0 and/or ¥>02 7 ^ 0, as shown in Fig. 3.2(b), 
for the potential with m 2 = — r? (h 2 > 0) and A > 0, 

V(ip\ + (fil) = -^-(¥ 3 i + ¥> 2 ) + ^(¥’1 + ¥^) 2 - (3.67) 

The minimum of V is realized by requiring 

(%r) = -pVoi + + ¥>02) = 0 ) 
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( — ) = 

\d<p2j o 

which leads to the condition 


~/x 2 <p 02 + A</>o2(</ 5 oi + Vm) — 0) 


¥>oi + </4 = v 2 = y or (<t>*4>)o = l<^o| 2 = Y = 2X' 


(3.68) 


(3.69) 


In other words, all points on a circle with radius u = ^/p 2 /A in the (^ 1 ,^ 2 ) plane 
correspond to the minimum of V, that is, the vacuum state is no longer unique but 
is 0(2) symmetric. One can choose any point as the physical vacuum. From (3.67), 
we obtain 


d 2 V 

dpi 

d 2 V 

d 2 V 

dipidip2 


— (-/x 2 + \{<pl + <^ 2 )) + 2A<p 2 , 

= (—/x 2 + A(<p 2 + <p|)) + 2A</? 2 , 


(3.70) 




Then, if we choose a point {‘fioi = w, <po 2 = 0) as the physical vacuum, we obtain 
the mass matrix as 


™lb = 


2Aw 2 0 
0 0 


(3.71) 


Therefore, we find that <p'i — ¥>i ~ u corresponds to a massive particle with mass 
m 2 = 2Aw 2 , while <p' 2 = ip 2 is massless. ip' 2 ls called a “Goldstone boson”. Actually 
using these new fields, we can rewrite the Lagrangian (3.61) as 

C = \{d^ i) 2 + i(a M ^) 2 - i(2Au 2 )^ 2 + \v(p[((f 12 + rf) ~ + tf?- (3-72) 

This Lagrangian (3.72) has no longer 0(2) symmetry, though the original La¬ 
grangian (3.61) has it explicitly. That is to say, the symmetry of the original 
Lagrangian has been broken by breaking the symmetry of vacuum. This is called a 
hidden symmetry or spontaneous symmetry breaking(SSB). 

In summary, starting from the Lagrangian which has a global symmetry, taking 
the negative parameter m 2 = — p 2 (p 2 > 0), and breaking the symmetry of the vac¬ 
uum states by choosing a particular point among symmetrically degenerate vacuum 
states, we found that a massless particle, called the Goldstone boson, appeared. 
This mechanism is called the Goldstone theorem. (Goldstone, 1961; Goldstone, 
Salam and Weinberg, 1962; Bludman and Klein, 1962) In general, when a global 
symmetry is spontaneously broken, a number of massless Goldstone bosons appear 
depending on the symmetry properties. 


• Useful parametrization for the t/(l) model 
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In literature, another and more useful parametrization of the field is often used. 
Introducing the two real fields p(x) and 0(x), we can write the complex field 0( x) 
in (3.60) as 

<t>(x) = ±p(x)e ie ^/\ (3.73) 

where v is a constant given by (3.69). Then, substituting 

d li< t>=±e ielv (d fl p+ i -pd li 9), (3.74) 

into (3.60), we have 

C = \(d,p) 2 + ^P 2 M 2 - V{p 2 ). (3.75) 

To find the mass of the particle corresponding to the excitation of the would-be 
radial field p{x), we expand it as p{x) = v + r](x) and obtain 

c = \(d,v) 2 + \(d„6) 2 + lv(d„0) 2 + ^ 2 (d,0) 2 - V(p 2 ), (3.76) 

with 

V(p 2 ) = \{2p 2 W + W + ^ - \p 2 v 2 . (3.77) 

It is noted that there is no quadratic term of 0 in this Lagrangian. From this La- 
grangian, one can easily find that we have a massive r] field with mass m n = \[2p 2 
and a massless Goldstone boson 6. 


• Extention to the SU( 2) model 


<t>\ 

02 


, composed of 


In the SU (2) model, the field is given as a doublet, 0 = 

2 complex fields, 0i = {tp\ + i<p 2 )/V% and 02 = + *</?4)/\/2- Then, the global 

517(2) invariant Lagrangian is given as 


C = dptfn-V (3.78) 

V = -p 2 ^<f> + H^4>) 2 , (3-79) 


with p 2 > 0 as before. Now, by using 4 real fields H(x) and C( x ) = 1,2,3), we 
write 0 as 


0 = 


_l_p*T'4*(a)/2« 

V2 


0 V 

v + H(x) ) ’ 


(3.80) 
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where the VEV v is defined as </?qi + ^02 + ^03 + ^04 — v 2 = ‘j- or {$<f>)o = |0o| 2 = 
Y — 2 \- Then, putting 


d 6 — — e iri C/ 2v I ( ® ^ 4 . -~Q £ ,: ^ ® 

d ^~ V2 H dpH ) + v 2 W 


v + H 


(3.81) 


into (3.78), we get 


£ - i|(0 v + H)- 


, r 
"2 


x {(a4) + ^(„ + ° ff )}- v '«” + ff fl- < 3 ' 82 > 


which leads to 


£ = ^Hd^H + —d^eiv + H) 2 - V ((« + F) 2 ) , (3.83) 


using the relation r‘r^ = <5 IJ +ieijkT k . Prom this Lagrangian, one can see that the 3 
fields £ l (z = 1,2,3) have no mass terms. They are massless Goldstone bosons, while 
F is massive with mass mu = v^M 2 - The original SU(2) symmetry disappeared. 
It is remarkable to note that the number of Goldstone bosons is equal to the number 
of generators breaking the symmetry of the vacuum state. In an example of SU( 2) 
model, the number of Goldstone bosons are three, (i = 1,2,3), corresponding to 
the 3 generators, r* (i = 1,2,3). 

This is generalized for cases with larger symmetries. Now, let us consider an 
SU (n) symmetric world with n component fields, 


( ¥>i \ 
<P2 


(3.84) 


\ / 

The SU(n) group has N — n 2 - 1 generators T l (i = 1,2, ■ • ■, N) and under this 
group, <p transforms as follows; 


cj) 0 = e - ir7 "<t>, (3.85) 

or under an infinitesimal transformation, 

S(t> = ~i6 i T i ^>, (3.86) 

where the summation over i is implied. T l are the matrices with nxn components. 
Suppose that the vacuum is not unique but is symmetrically degenerate. Then, 
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choose a particular vacuum state, 




<Po — 


\ *Pn J q 


(3.87) 


Under an infinitesimal transformation, a change of this vacuum state is S(j) 0 = 
-i6 l T l <p o- Then, one can see that the vacuum is not invariant under operation of 
generators T l if T'^o 7 ^ 0. Suppose that M(< N) generators T l (i = 1,2, • • ■, M) 
are such generators, while other T l (i = M + 1, M + 2, • • •, iV) leave the vacuum 
invariant as T l 4> 0 = 0. Then, by introducing a new field <j>' = (j> — <f>o, we expand 
the potential V around the vacuum as before, 


—o + g(£) /i + i| i (^) / > i + .., <3.SS) 


with 


= 


\9<PaJo 


= 0, and m 2 b - 


d 2 v \ 

dipadpb) o' 


(3.89) 


The potential should be invariant under SU(n), i.e. SV = -§^-5ip a = 0 which, 

by using (3.86), results in 


OT T 

£ q^T>» = 0- for * = 1,2, 


a,b 


Then, by differentiating this equation by ip c , we obtain 

9 2 V T i , v-' 9V_ i _ 

\d<p c dVa ^ aC ’ 

a,o a 

which leads to, at the vacuum <p = (j) 0) 

E«(^)o=0, for i = 1,2, •••,7V. 

a,b 


(3.90) 


(3.91) 


(3.92) 


As described above, TVo i 1 0 for i = 1,2, • ■ ■, M. Therefore, T l <p 0 span a M- 
dimensional subspace in the n dimensional space spanned by < j> of (3.84) and thus, 
m 2 ca has M zero eigenvalues. In other words, there exist M Goldstone bosons or to 
each generator which breaks the symmetry, a Goldstone boson appears. 

For more detailed discussion on the spontaneous symmetry breaking and Gold¬ 
stone bosons, see, for example, reviews by Abers and Lee (Abers and Lee, 1973) or 
Coleman (Coleman. 1975). 
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3.4 Higgs Mechanism 

In the previous section, we discussed the spontaneous symmetry breaking under the 
global symmetry, i.e. the Goldstone theorem. Here we go further by extending the 
global symmetry to the local gauge symmetry. 


• The 1/(1) model 


First, let us consider an example of the scalar electrodynamics for a scalar com¬ 
plex field <t> — i + iy> 2 , which is local (7(1) invariant. The Lagrangian is given 
by 

£ = -\F^F^ + {D^Y(D*t)-V{4r4>), (3-93) 

V{<P*4>) = -/*V0+W*) 2 , (3.94) 


with ft 2 > 0, where F^„ = <9 M A„ — A M is the field-strength tensor for photon 
fields A M and D^cj) is the covariant derivative D^(f> — (d M — ieA^cp as before. The 
Lagrangian (3.93) is invariant under the following 17(1) gauge transformation, 

cj) -i <// = e~ ia ( x) <j>, (3.95) 

Afi -> A' lt = A lt - is M a(x). (3.96) 

Now let us minimize the potential V as before with \<fi 0 \ 2 — ^ Then, if 

we parametrize the field 4>{x) as 


cj ) ( x ) = ±( v +r 1 (: r))e" ( *)/”, 


(3.97) 


where rj(x) and 9(x) are real fields. We can define a new set of fields by taking a 
particular gauge transformation with a(x) = 0(x)/v, which is called the “unitary 
gauge”, 


4>{x) 

Ap(x) 


-> 4>(x)' = e-^W/Xx) = ~(v + 7 7 (*)), 

-> B^x) = A M (z) - — d^dix). 


Under this unitary gauge transformation, we have 


D n<t>{x) -t D’^'ix) = (d„ - ieB^)~(v + rj{x))) 

and 


(3.98) 

(3.99) 


(3.100) 


Ffj,, /(A) — dfjA u — di/A M —> Fu,„(B) — d^B^ — d v B^. (3.101) 
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Here we can easily show F fil/ (B) — F lll/ (A) by substituting (3.99) into (3.101), that 
is, the field-strength tensor F^ is gauge invariant as it should be. Substituting 
(3.98) ~ (3.101) into (3.93), one can rewrite the Lagrangian as follows; 

£ -ieB^v + r))] 2 - y(u + r ?) 2 - j(v + n) 4 ~ \f )iv {B)F> 1v {B) 

= \dvVd>r, - hW - \f^(B)F^(B) + |(e«) 2 B M B" 

+ + 2v ) - A vrf - j 7 ? 4 . (3.102) 

We can see that this Lagrangian describes a massive vector boson B with mass 
ms = ev and a massive scalar rj with mass m n = \/ 2 /r 2 . 77 is called “Higgs boson”. 
Here we have no Goldstone boson which has gone out of the Lagrangian (3.102). 

In summary, by extending the symmetry of the Lagrangian from the global to 
local one, we found that the massless Goldstone boson 9 disappeared and a massive 
gauge vector boson B and a massive scalar boson rj called Higgs boson came out. 
This is called the Higgs mechanism. (Higgs, 1964, 1966; Englert and Brout, 1964; 
Guralnik, Hagen and Kibble, 1965; Kibble 1967) The Goldstone boson 9 was eaten 
up by the gauge boson B and became the longitudinal component of it. It is re¬ 
markable to note that in the Higgs mechanism the degree of freedom is conserved, 
that is to say, starting from 2 real scalar fields (<^ 1 ,^ 2 ) or (rj, 9) plus 2 polarization 
states of massless photons A M , we finally got one real massive scalar field tj and one 
massive vector boson B fl with 3 polarization degrees of freedom 


The 517(2) model 


Let us next consider the non-Abelian 51/(2) model by generalizing the U( 1) 

<t> 1 


model. We have again the complex doublet field <j> = 
invariant Lagrangian with SU (2) symmetry is given by 


<t> 2 


£ - {D^)\D^) - \Fi u F^ - V(i 


Then, the gauge 


(3.103) 


with 


= 


F l = 


1,2,3) 


( d » ~ *0 ( 
d^Ai - duA^ + geijkA^A*, 


= -j 1 2 ^ + A(^) 2 . (m 2 >0) 


(3.104) 

(3.105) 

(3.106) 


Now, introducing the new real fields H(x ) and C( x ) (* = 1; 2,3), let us parametrize 
the field <j>(x) as 


V2 


(j>{ x) = — 7 =e 


v + H(x) 


(3.107) 
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Then, by taking the unitary gauge as before, we define the new fields as 
m -4 m = u(x)<j>(x) = ± w+ ° H(x) ), 

-4 = U(x)A fl U~ 1 - 


(3.108) 

(3.109) 


with 


U(x)=e~ iTi * i ( x)/2v , (3.110) 

where the summation over i is implied. This transformation leads to 

D„4, (D„«' = 0„-i^B')i( tJ + ^ l) ), (3.111) 

F^(A)F^(A) -4 F; iy (B)F i ^(B) = F^ I/ (A)F i ^(A), (3.112) 


with 


KAB) = W - d„Bi + ge ijk BiB k v . (3.113) 

Then the Lagrangian becomes 

C = - \fI v {B)F^ v {B) + /iW) - A(4>V) 2 - (3.114) 

Prom this Lagrangian we can see that the three Fi x ) (* = 1,2,3) fields disappeared. 
Where did these fields go? We can find the answer by writing £ in terms of the 
component fields of <j>' . First, let us write down the covariant derivative term, 

[(£>^)'] to (D^)' a = \d,Hd»H + g*BiB^ (y) (y) *Vo 

= Id^Hd^H+^-BiB^iv + H) 2 . (3.115) 

2 o 

Then, we finally obtain the following Lagrangian, 

£ = ^d„Hd^H - ^ 2 H 2 -^F^(B)F i ^(B) + ^-BlB i ^ 

+ ^B i ll B i ^H{2v + H) - A vH 3 - ^H 4 - j. (3.116) 

A triplet of massive vector fields B * (i = 1,2,3) with mass ms = § gv and a single 
massive scalar, i.e. Higgs boson H with mass m# = y/2g? appeared. We found here 
again that the Goldstone bosons F (i = 1,2,3) were eaten by the gauge bosons B l 
(i = 1,2,3) to make their longitudinal components. This is the Higgs mechanism in 
the non-Abelian SU (2) gauge theory. It is again interesting to note that the number 
of degrees of freedom is conserved in the Higgs mechanism, i.e. the 3 Goldstone 
bosons F (i = 1,2,3) become the longitudinal components of the respective gauge 
fields which lead to appearance of 3 massive vector bosons B l (i = 1,2,3). The 
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discovery of the spontaneous symmetry breaking (SSB) and the Higgs mechanism in 
the non-Abelian gauge theories made a great breakthrough toward the unification 
of electromagnetic and weak interactions, which will be discussed in detail in the 
next Chapter. 


Problems 

3.1 Show that when the generators T l satisfy the Lie algebra (3.17), 

[T\Ti] = if ijk T k , 

the “charge” Q l also satisfy the same Lie algebra (3.16), 

[Q\Q j ) = ifwQ k , 

where Q l is defined by (3.13). 

3.2 Using the canonical commutation relations of (3.4) and (3.5), prove the relation 
(3.18). 

3.3 In the SSB of the global 0(2) symmetric theory described in 3.3, discuss what 
happens if we take the vacuum to be (<poi = yj, <Po 2 = instead of (y? 0 i = v, 
(f 02 = 0), and show that both cases are physically equivalent. 

3.4 Derive (3.116). 

3.5 Consider the global 0(n) symmetric Lagrangian, 

£ = ^dn<t> T (x) -d^cp- V{<i) T ■ 4>), with V{<t> T •</>) = ^-<P T ■ <P + ^{4> T ■ <P) 2 , 
where 

/ <Pi \ 

>P2 

<t> = 

\ <Pn / 

After the spontaneous breaking of this 0(n) symmetry by taking m 2 = —/i 2 (/r 2 > 
0), show that one can make the vacuum to be invariant under 0(n — 1) transfor¬ 
mation, that is, 0(n) breaks into 0(n - 1). Furthermore, show that the number of 
Goldstone bosons is equal to the number of broken generators of the original 0(n) 
group. 





Chapter 4 

THE STANDARD MODEL OF 
ELECTROWEAK INTERACTIONS 


In this Chapter, we describe the Glashow-Weinberg-Salam(GWS) model of elec- 
troweak interactions (Glashow, 1961; Weinberg, 1967; Salam, 1968). It is a non- 
Abelian gauge theory with SU{2)l x U (l)y gauge symmetry accompanied by the 
Higgs mechanism. It is the first successful model toward the unified theory of ele¬ 
mentary particle interactions. The model is extremely successful in particle physics 
phenomenology without serious discrepancy with almost all existing data; only one 
exception at present might be an evidence of massive neutrino established with re¬ 
cent observation of neutrino oscillation. The discovery of W ± and Z° bosons with 
expected masses and a weak neutral current mediated by a massive neutral vector 
boson Z° is a great triumph of the model. Here we first consider the model for 
one lepton family of an electron e and the corresponding neutrino v e . Then, the 
realistic case of more lepton families is described. Extension to the quark sector 
is also discussed, where the non-diagonal quark mass matrices and the Cabibbo- 
Kobayashi-Maskawa(CKM) matrix are introduced. These embody the standard 
model of the electroweak interaction. 

4.1 Fermions in the GWS Model 

Let us start with the discussion on the one family of leptons, an electron e and 
its neutrino v e . v t is considered to be massless in this model, while e is massive 
with a small mass, m e ~ 0.5MeV. Both of them have spin 1/2. In this section, we 
assign these particles to the appropriate representations of SU(2)l x t/(l)y gauge 
symmetry, the gauge symmetry of the GWS model. 

As we have seen in chapter 2, in the weak processes such as 


fi ->• 

e 

+ P e + Vp, 

(4.1) 

7r~ —>■ 


+ 

(4.2) 

n -*> 

p + e +v e , 

(4.3) 


only left-handed leptons and right-handed anti-leptons take part in and the decay 
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amplitudes of these processes can be written down in terms of charged currents 


x) = Jp(a;) + = P e L(a;)7M e i( a; ) 


= ^e(x) 7 ^( 1 - 7 5 )e ( a;), 

(4.4) 

Jn(x)* = J M (x) _ = e L (a;) 7 M J/ e L 


= ^K x )ln.{ l -Ah)v e { x )- 

(4.5) 


This suggests that ez, and v e L could be arranged to simply make a doublet associated 
with SU(2) group. Let us introduce a lepton doublet composed of the left-handed 
components of fermions, 


L = 


(1 - 75 ) ( v, 


(4.6) 


Then, by using this doublet and 2x2 matrices in the so-called weak isospin space 

r 1 +ir 2 


r + 


' 1 -c .V 2 / 0 1 


T 1 - IT 2 


0 0 

0 0 
1 0 

one can rewrite the charged currents (4.4) and (4.5) as follows; 

J- = L^t-L. 


(4.7) 

(4.8) 


(4.9) 

(4.10) 


(Note that in this Chapter r’ are used to denote Pauli matrices, in order to make 
clear that they stand for the weak isospin, not the ordinary spin.) These forms 
suggest that the weak currents make an SU ( 2 ) group by introducing an additional 
neutral current, 

Jl = L^L 

= \velnv e - iejr, 7 M e L . (4.11) 

Then, we have 2 charged and 1 neutral currents, and J 3 , which couple to 
the weak bosons W± and A 3 ^, respectively, just as the electromagnetic current 
J® m (x) couples to photon A fl (x), as we will see later. Existence of and J 3 
suggests that we have weak isotopic triplet currents of SU( 2), belonging to the 
adjoint representation, 

J i ll (x) = Lr ril T i L = Lj ll jL, (* = 1 , 2 , 3 ). ( 4 . 12 ) 
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It is easily shown that the corresponding charges T 1 (i = 1,2,3) defined by 

T = J 4(x)d 3 x, (4.13) 

satisfies the SU(2)i algebra 

[T\T j ] =ie ijk T k , (4.14) 

with e ljk being the totally antisymmetric Levi-Civita tensor with e 123 = 1. In 
(4.12) the subscript L of SU(2)l means that the weak isospin currents are composed 
of only the left-handed Weyl fermions. In all of these currents, the right-handed 
component of electron e R does not appear. It has no interactions with any other 
particles and thus it should be a singlet under SU(2)l transformation, i.e. 

R = ^(! + 75)e = e R . (4.15) 

As mentioned above, in this Chapter we assume v e to be massless and thus v e has 
no right-handed component. At a first glance, one might think of identifying the 
neutral current J 3 of (4.11) with the electromagnetic one, J® m . But it is impossible 
because of the following reasons; (l)the neutral current J 3 (x) has no right-handed 
component, while the electromagnetic current J^ m (x) for an electron has both left- 
handed and right-handed components as follows, essentially because parity is a good 
symmetry in the electromagnetic interactions; 

= -e7 M e = -e L 7M e i ~ Srj^cr, (4.16) 

(2)the “charge” of neutrino which couples to A 3 is not zero but opposite to the 
one of electron as shown in (4.11), while it should be zero for the electromagnetic 
interaction, and (3)contrary to the electromagnetic current, the neutral current J 3 
maximally violates parity. These arguments suggest that gauge symmetry SU(2)l 
should be enlarged. Let us note that, though (4.11) leads to a wrong relation saying 
that the sum of electric charges of u e and e~ vanish, due to the property Tr r 3 = 0, 
it gives the correct difference of the charges, i.e. | - (— |) = 1 — 0, This implies that 
should have a piece generated by a diagonal 2x2 generator with non-vanishing 
trace. Namely the gauge group should be enlarged so that it contains a new 17(1) 
symmetry. Therefore, we also need another gauge field B M associated with the new 
17(1) symmetry. This new U{ 1) group should be independent of SU(2)l group and 
thus its generator should commute with the generators of SU(2)l, T l ( i = 1,2,3). 
The gauge group is thus extended to the direct product of SU(2)l and 17(1). Then, 
how can we practically realize it? We want to keep the form of the electromagnetic 
current for a fermion ijj with charge Q to be 

(4.17) 

where Q is the charge for the fermion ip. For the case of an electron, the eigenvalue 
of Q is Q = -1. From (4.17), we can define the generator of 17(l) em for an electron, 
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lepton family 

Q 

(T, T 3 ) 

y 


0 

(!,+§) 

-l 

eioPL, t l 

-l 

(J.-i) 

-l 

eR,HR, T R 

-l 

0 

-2 


Table 4.1 Quantum number assignment of lepton families in the GWS model. 


i.e. the charge of an electron as 

Q = J J(x)o m d 3 x = - J ejoed 3 x — - J (e^ez, + e^ R eji)d 3 x. (4.18) 


(The reader should not be confused by this notation, though the same Q as in (4.17) 
is used here.) However, since this generator Q does not commute with T l defined 
by (4.13), U (l)em and SU(2)l cannot be simultaneous symmetries of the model. 
Then, we look for a new U( 1) symmetry, called U (l)y, so that Q is given by a 
linear combination of the generator T 3 of the SU (2) i group and the generator of 
this new C/(l)y group. In doing so, it is interesting to note the relation 


Q ~ T3 = J d3x - \ e L e L ~ , (4-19) 

which shows that each element in an SU (2) l doublet (v e e) l L has the same eigenvalue 
— | and the eigenvalue of e# is —1. Furthermore, Q — T 3 commutes with T l (i = 
1,2,3), i.e. [Q — T 3 , T l ] = 0, that is, Q — T 3 and T l can be simultaneous symmetries 
of the model. Thus, it is reasonable to define this new generator of the U(1)y group 
as y = Q — T 3 or 


Q = T 3 + (4.20) 

4U 

The eigenvalue of Y is called weak hypercharge; the name of this quantum number 
is originated from the fact that (4.20) is of the same form as Nakano-Nishijima- 
Gell-Mann relation (Nakano and Nishijima 1953; Gell-Mann 1953), established in 
1950s in successfully describing the hadron classification, in which Y was called 
hypercharge. These quantum numbers for an electron e and its neutrino v e are 
summarized in Table 4.1. (In this table, the quantum numbers of muon and tau 
families are also given,) 

In summary, the GWS model is the SU(2)l x U(1)y gauge theory and we have, 
as matter fields, a left-handed doublet L and a right-handed singlet R of 517(2) 
group which are represented as 


L = 



R = eR. 


(4.21) 
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The model starts with the Lagrangian constructed with L and R, which is invariant 
under the direct product of SU(2)l and U (l)y groups: 

SU(2) L : L' = e- iai{x)I rL, R^R' = R, 

U(1) Y : L -4 L' = e^L, R -4 R' = e Wx) R, (4.22) 

where a 1 (i = 1,2,3) and 0 are group parameters for weak isospin and weak hyper¬ 
charge operators, respectively. Since we are considering the local gauge invariant 
Lagrangian, a 1 and 0 are ^-dependent. 


4.2 SU(2)l X U(1)y Invariant Lagrangian 

In the previous section, we introduced the representation of an electron and its 
neutrino with respect to the SU(2)l x [7(l)y symmetry, which is given in (4.21). 
The gauge invariant Lagrangian with STJ(2)l x f/(l)y symmetry for these fermions 
is constructed as 

C F = Li^idfj-igy A^ + ^g'B^L 

+ Ri'f(d ll + igfB ll )R, (4.23) 

where A l ^(i = 1,2,3) and B^ are gauge boson fields associated with SU(2)l and 
17(1)y, respectively, g and g' are the gauge coupling constants corresponding to 
SU(2)l and I/(l)y, respectively. Here the explicit forms of the covariant derivatives 
for L and R come out from the general form 

Bg, ~ dn ~ ~ ~2 (4.24) 

by taking account of Y = —1 for L and Y = — 2 for R, respectively (see Table 
4.1). R is a singlet of SU (2) F and hence does not couple to A 1 . In (4.23), the 
fermion mass term, which connects L and R fields, does not appear because it 
violates SU(2)i x [7(l)y invariance. Therefore, all fermions, i.e. an electron and 
its neutrino, are massless at this stage. 

The kinetic term of the gauge fields which should be added to C F is given by 

Co = - \b^\ (4.25) 

with 

F lu = Q^Ai-dyA^ + geakA^Al, (4.26) 

Bf, v = d^By - d v Bn, (4.27) 

where F^ v {i = 1,2,3) and B fiv are field strength tensors of gauge fields correspond¬ 
ing to SU(2)l and t/(l)y, respectively. As is well known, the mass terms of these 
gauge bosons do not appear because of the local gauge invariance. 
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In order to make fermions and gauge bosons massive and make the real world, 
we need spontaneous breakdown of gauge invariance, i.e. the Higgs mechanism, 
described in the previous chapter. Since we are living in the f/(l) em symmetric 
world with massless photon, we need to have the following breaking, 

SU{2) l x U (l)y —t U(l) em . (4.28) 

To realize this symmetry breaking, we introduce the scalar fields, called Higgs 
bosons, which give rise to the Higgs mechanism. Since we start with 4 gauge 
bosons(3 associated with SU (2) i and 1 with I/(l)y) and finally want to have 1 
massless photon associated with U(l) em , we need scalars with at least 4 degrees of 
freedom. The simplest example of such scalars, which is called the minimal model, 
is an SU( 2) doublet of 2 complex scalar fields whose weak hypercharge is Y# = +1, 

<A = ( £o ) > (4-29) 

where y> + and ip° are positively charged and neutral complex scalar fields, respec¬ 
tively. The Lagrangian for these scalars is given by 

C S = {D^)\D^)-V{tfcj>), (4.30) 

with 

= (dp — *<?— • Afj, — —g 

where the explicit form of the covariant derivative is due to 
term V(tf<j>) being gauge invariant is given by 

v(tf<t>) = m 2 tf<f> + \(tftf 2 , (4.32) 

where m 2 and A are real constant parameters. A should be positive to ensure the 
stable vacuum. Further higher power terms of tf<f> are not allowed in order for the 
theory to be renormarizable. 

We can also add the coupling terms between fermions and scalars, called Yukawa 
interaction terms, which are SU(2)l x U(1)y gauge invariant and are to provide 
the electron mass after the spontaneous symmetry breakdown, 

£y = -G e {L(j)R + Rtf L) + h.c., (4.33) 

where G e is called Yukawa coupling constant and cannot be determined within the 
GWS model itself. One can easily check the SU{2)l x U (l)y invariance of £y using 
the values of hypercharge of L, R and <t> defined above. 

A full set of SU(2)i x U(1)y gauge invariant Lagrangian of the GWS model is, 
thus, given by the sum of pieces presented above, 

£ = £f + £g + £« + £y- (4-34) 


(4.31) 

= +1. The potential 
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Strictly speaking, the gauge fixing term with gauge parameter £ (of gauge”) 
and the associated term for Faddeev-Popov ghosts also should be included, in the 
quantization procedure of the theory. (See, for example, the books by Peskin and 
Schroeder (Peskin and Schroeder, 1995) or Pokorski (Pokorski, 2000).) 


4.3 Spontaneous Breaking of SU(2)l X [7(l)y Symmetry 

As shown in Fig. 3.2(b), the potential V(0t0) of (4.32) with positive A and negative 


m 2 = — /i 2 (/x 2 > 0) has a minimum at the value of 0 determined by 

2 / 2 ~ 

0 + 0 = I0| 2 = y, with v = y y. (4.35) 

Then, as described in Chapter 3, spontaneous symmetry breaking occurs when the 
scalar doublet <j) of (4.29) develops a vacuum expectation value 

<h = <O|0|O) = ( ) • (4.36) 

It should be noted that though T 3 and Y do not annihilate the vacuum 0 O 

T ’*° = KJ -1 ) ( »/°V2 ) - -5*' (4 - 37) 

Ycj> o = 0o, (4.38) 

and thus they are broken generators, i.e. e~' la ' i ' r '<po ^ and e _t/3 2 <f> 0 ^ </> 0 , the 
electric charge operator Q is not a broken generator, 

0*, = (T» + !)*,= (J o)(„ / °^)=0. (4.39) 


i.e. e~ ieQ cf>o — 00 ) where e is an arbitrary parameter. Therefore, even after the 
symmetry breaking, there remains a symmetry associated with the charge operator 
Q of U{ l) em being compatible with our real world. 

Now, it is convenient to parametrize the scalar doublet with 4 degrees of freedom 
in terms of the fields denoting the shifts from the vacuum state 0o, 

Here the original 2 complex scalar fields ip + and <p° in (4.29) are replaced by 4 real 
fields, £i(i = 1,2,3) and H, where & are so-called Goldstone bosons being absorbed 
into the longitudinal components of W ± and Z° bosons as described later and H is 
a Higgs boson. (4.36) leads to zero vacuum expectation values for all of these fields 

(0|£i|0) - (0|i?|0) = 0. (* = 1,2,3) (4.41) 
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Here we can rewrite the Lagrangian in the “unitary gauge”, where 3 Goldstone 
bosons & disappear by being ’eaten up’ by gauge bosons, W ± and Z°, and thus, 
physical particle spectra and their interactions become apparent. By applying the 


unitary 517(2) transformation 

17(0 = e~ i? ^ 2v , (4.42) 

one can come to the real world induced in the unitary gauge. Then, we can define 
the new fields in our real world as 

= ( („ + h)/V2 ) = 72 ( ” + H)X ' <“- 43 > 

L 1 = U(0L, (4.44) 

K = UiOA^Utf)- 1 ~ fauiWHO, (4-45) 


with x = ^ i ^ and A M = A M ■ where the new fields transformed from the 

original ones are presented with a prime. R and remain unchanged under this 
517(2) transformation, 

R 1 = R, ( 4 . 46 ) 

Bl = B». ( 4 . 47 ) 

The Lagrangian is invariant under this transformation and one can rewrite each 
piece as 

C F = L'i^{d»-ig T --A^ + yB'JL'+ R'iY{dv + ig'B l , l )R', ( 4 . 48 ) 

Bg = ( 4 . 49 ) 

C s = (D^nD^)' -V(<t>'W), ( 4 . 50 ) 

By - -G e (L'cf>'R' + R'<t>'iL') + h.c. ( 4 . 51 ) 

Now let us discuss the physics described by this Lagrangian realized in the 

unitary gauge. First we consider the scalar sector. The scalar fields generate masses 
of gauge bosons and those of quarks and leptons via the Higgs mechanism. B s is 
explicitly written as 

B s = (D^)'(D^y ( 4 . 52 ) 

with 

(D^y = (d, - ig T - • - '-g'B'^' 

= + ( 4 . 53 ) 
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The first term of (4.52) contains the mass-squared term for weak gauge bosons 
which is originated from the quadratic terms of gauge fields as shown in the follow¬ 
ing, 

C mass = + + 

= J {g 2 ^ ■ A'» + g' 2 B'^ - igg'B'^) 

= £(s 2 4M'i , ‘ + 9 2 A'ZA i2 » + [gA« - g'B'J 2 ), (4.54) 

where in turning from the 1st line to the 2nd line, we used the formula r l r J = 
6{j + ie l i k T k . Now let us introduce charged boson fields W ± defined by 

w t = (4-55) 


Then the sum of the 1st and 2nd terms of (4.54) can be written as \g 2 v 2 W+W 
It means that the charged vector bosons W ± are massive with the mass 


M w = ^gv- 


(4.56) 


The remaining term which is described by neutral fields can be written as 


8 


« K) 


-99 


-99 

„'2 


B't* 


(4.57) 


which can be diagonalized into 

j(Zp A,) ( g2 + 9 ' 2 o ) ( ^ ) = + 9 n )Z^ + 0 • A,A\ (4.58) 

by an orthogonal transformation 
Z, 


M _ 


cos<?iy — sin^vv 
sin 6w cos Ow 


A* 

B'„ 


(4.59) 


where Ow is called the weak mixing angle or Weinberg angle. The diagonalization 
leads to 


tan 6w = 


(4.60) 


or 


sin Ow 


cos Ow — 


. - - - wu v L/y - . -— . 

V 9 2 + 9 a vV + g' 2 

From (4.58), we see that the neutral Z boson becomes massive with the mass 


(4.61) 


M z = ^vy/g 2 + g' 2 , 


(4.62) 
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and another neutral boson A M is massless and hence can be identified with the real 
photon. Note that in the GWS model the mass of Z° boson is related to the one of 
W ± bosons as 


M z 


M w 

cos 6 W ' 


(4.63) 


One can see that the masses of W ± and Z° are quite large; in fact by using (4.83), 
(4.84) and (4.89) below, we can estimate them to be 


M w 


M z 


1 1 e 2 

2 9V ~ 2^s/2G f 
76 


M w _ 

cos 6w ~ sin20w 


)l/2_J_ ^ 38 

sin 9w sin ffw 
GeV > 76GeV. 


GeV > 37GeV, 


(4.64) 

(4.65) 


The values of Mw and Mz are obtained if sin#vv is determined experimentally. 
Actually, the value of sin 2 Ow is obtained in experiment to be around 0.23, leading 
to Mw — 80GeV and Mz — 90GeV. 

The potential term (4.32) of scalars becomes, after symmetry breaking, 


V(<j)'W) = -y ( v + H ) 2 X*X+^( V + h ) 4 (x'x ) 2 

= + A vH 3 + (4.66) 

Prom (4.66), we see that the mass of the physical Higgs boson H can be identified 
with 

M h = ^/V, (4-67) 


whose value cannot be predicted, based on some principle, in the GWS model. After 
all, in the unitary gauge the Lagrangian C s results in (up to a constant term) 


c s = (D^nD^y - v(*V) 

= Id^Hd^H - \m 2 h H 2 - A vH 3 - yff 4 


+ 


£(H 2 + 2Hv) 


—L-Z^Z" + 2 w+w- 1 * 

COS 1 &w 


+ M^W+W-v + -Mlz^zr (4.68) 

Let us consider next the Yukawa interaction term Cy in the unitary gauge 


Cy = 


—G e (L'cj>'R' + R'^L') + h.c. 

~ G e + H ) e R + S ' R ^ V + H ' )e ' L ) + hc ' 


G e v _i i G e 


y/2 


e e 


V2 


He'e'. 


(4.69) 
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Since only physical degrees of freedom appear in the unitary gauge, we can identify 
e' to be physical electron and thus we suppress the prime ' hereafter. Then, the first 
term of this equation corresponds to the mass term of an electron with the mass 


G e 

m e — —7=0. 

V2 


(4.70) 


It is interesting to note that the electron mass is proportional to the vacuum expec¬ 
tation value v of the Higgs boson as well as in the case of weak gauge boson masses. 
The second term shows an interaction term of an electron to the Higgs boson with 
the coupling constant 


G g trie 

V 


(4.71) 


It is remarkable that the coupling constant is proportional to the electron mass. 

In fact, the Higgs coupling to fermions in the Feynman rule (see Appendix 
B.3) is given by = -t g = -i{y/2G F ) ll2 m s from (4.56), (4.89) and 
(4.83), where Gf is the Fermi coupling constant and m/ is the mass of the fermion 
coupled to a Higgs boson. Since v ~ 246GeV (see (4.85)), the Higgs coupling to 
ordinary quarks ( u, d, s, c, b) or leptons (i/ e , e, i/ M , p, u T , r) is extremely small and 
one can neglect the Higgs interaction effects on these particles. However, if there 
exist extremely heavy fermions such as the top quark, 4-th generation quarks ( Q ) 
or leptons (L), the Higgs coupling to those particles becomes large and cannot be 
neglected. (Inazawa and Morii, 1988; Strassler and Peskin, 1991) It might be even 
larger than gluon coupling to those extremely heavy quarks. 


4.4 Charged and Neutral Currents, Comparison with Effective 
Fermi Theory 

The fermion part C F of (4.48) in the unitary gauge is written as 

C F = VvfdyJJ + R'iYdpR' + gJp • + 9 - B'», (4.72) 

where 

J, = V^-V, (4.73) 

Jp = -L'^L' -2R'^R'. (4.74) 

In (4.72), the 1st and 2nd terms result in the conventional kinetic terms of an 
electron and its neutrino 

L'i-y^d^L' + R'i^d^R' = ei^d^e + (4.75) 

where the prime for electron and neutrino fields is dropped in the right hand side. 
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Fig. 4.1 


From (4.72), one can pick up the charged current interaction which is given by 
£cc = + JlA' 2 *) = + J+W + n, (4.76) 

where 


J± = Jl±iJl- (4.77) 

J± are explicitly expressed as 

= J l+ iJ l = L'^t + L' = v eL 'y ll e L - ^P e 7 „(l - 7 s)e, (4.78) 

= J l ~ iJ l = = SL-y^i/eL = ^e7 p (l - 75 )^- (4.79) 


Now, it is interesting to compare these charged currents with current x current 
interactions in the effective Fermi theory discussed in Chapter 2. Let us consider 
the lowest order process of v e + e -A u e + e in the low energy, which occurs through 
an exchange of W ± boson (Fig. 4.1). The corresponding Feynman amplitude is 
given by 


which reduces to 



q 2 - My, + it 


J- 


M = —i 


-S. _ J+fi 

2 mV 


l w 


(4.80) 

(4.81) 


in the limit of q 2 /M^ -> 0. The same amplitude can be obtained from the effective 
Fermi theory as 

C e M = • 4J + %- = (Pe7 M (l - 75) e ) (e 7M (l - 75 K) • (4.82) 
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Therefore, one can see that the Fermi theory of weak interactions is the low energy 
effective theory of the GWS theory. Comparing (4.81) with (4.82), we obtain 


G f _ g 2 
y/2 8 My 


By using 


G f = 1.16639 x 10~ 5 Gey“ 2 


(4.83) 


(4.84) 


and (4.56), (4.83) and (4.84), one can determine the vacuum expectation value of 
<f> as 

v = (V2Gf)~* ta 246 GeV, (4.85) 


which is called weak scale. 

From (4.72), by using (4.59) one can also extract the neutral current piece 
C NG = gJlA l3 » + 

J Y 

= {gsm.6 w Jl + g'cos6 w -^-)A il 

J Y 

+ (g cos & w Jl ~ g' sin(9iy (4.86) 

Recalling (4.20), we have 

= + • ( 4 - 87 ) 

By using this relation, the current coupled to A 11 in (4.86) becomes 

JY 

gsmOwJl. + g' cos6\y-^- = g' cosOw + {gsinOw - </cos0w)J 2 . (4.88) 

The second term vanishes from (4.60). The first term can be identified with the 

interaction of an electron to photon and hence we can identify the coupling constant 
to be electric charge e, i.e. 

e = g'cos9w = gsin9w, (4.89) 


or 


1 1 
— — r + 


e 2 g 2 g‘ 


1_ 
,/2 ' 


(4.90) 


The current coupled to becomes 


J, 


gcos6 w J 3 -g'sin0 w -%- =--x—(«/£ - sin 2 0 w J em ), (4.91) 

Z COS U\y 
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and thus we define a new neutral current coupled to as 

4 = 4-sin 2 0 w j; m . (4.92) 

In general, if a fermion family is given by 

L =( f f) ’ Rf = /*. Rf ' = fk> ( 4 - 93 ) 

J t f is explicitly written, by using (4.11) and (4.17), as 

4 = 4 - sin 2 e w j; m 

= ~ sin2 6 w{Qfhnf + Q'fJ'inf) 

= aifLlph + a f R fulfil + a{ f' L ^f' L + a f R f' R j^f' R , (4.94) 


where Q/, Q'j are electric charges of /, /' (in the unit of e), respectively, and 



a{ = ^-QfSin 2 e w , a{ = - Q) sin 2 9 W , 

(4.95) 


a R = —Qfsin 2 0w, a R = —Q'fSm 2 6w- 

(4.96) 

Or, J z is often written in literature as 



■4 = hArt - c f A i6)f + IWC - cZrif, 

(4.97) 

with 

c v = \(a f L + a R) = \-Qf s[n2d w, 

(4.98) 


C A = \( a L- a R) = \> 

(4.99) 


c v = \( a L + a R) = -Q'fSin 2 e w , 

(4.100) 


Ca = 2 ^ -a ^ = _ 4' 

(4.101) 

Similarly as in (4.81), the lowest order amplitude for the low energy 
current processes, e.g. v e + e v e + e, shown in Fig. 4.2, is given by 

neutral 


M = i 1 ,,,/., 4J Z ». 

2! M| cos 2 9w * 

(4.102) 


where the factor ^ comes from the fact that is the square of the identical 

current. Thus, the corresponding effective Lagrangian for neutral current processes 
can be written as 

^ = -^• 4 ^, ( 4 . 103 ) 
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e e 

Fig. 4.2 


defining ^ = 8M a ^os 2 e w • Hence, we see that the total effective Lagrangian at low 
energy can be described by 

£ eff = ■ 4(J+J-" + (4.104) 

where 


Gn = Ml 
G f M| cos 2 


(4.105) 


It should be noted that p = 1 is the inevitable consequence of the fact that the 
Higgs scalars belong to the doublet representation of SU(2)l■ It is no longer the 
case for Higgs scalars with higher dimensional representations of SU(2)l, such as 
triplet. To see this explicitly, let us consider the case with higher representation of 
Higgs multiplet. Then, we can only replace the Higgs doublet 0 in C s of (4.52) by 
a new Higgs scalars <j>' h (x) of higher multiplet. The gauge boson mass term can be 
written as well as in (4.54) 


= fxlGtffc • 4 + • 4 + g'^BJxh, (4.106) 


where Th and Yh are appropriate generators corresponding to SU(2)l and U (l)y, 
respectively, and Vh is the vacuum expectation value of this Higgs scalar. Xh rep¬ 
resents a similar vector as in (4.43), in which only one nonzero component is 1 and 
others are zero. For this multiplet we have again Qh = 7 ),3 + ^j*-, as before, where 
Qh is charge of h and 7 /, 3 is the 3rd component of ?),. Now, let us put Q h = 0 
for one component of <l>' h {x). For this component, we have ^ = - 1 ^ 3 . Then, the 
vacuum expectation value Vh of this field generates the masses of gauge bosons, ex¬ 
cept the photon, in the same way as the case of the doublet Higgs scalars described 
before. The resultant mass term is 


V f x{ [ffh • • A"* + + 2gg'f h ■ A'^B'^ Xh 
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r » 2„,2 


^(A'ZA'^Th + A!^A'^T 2 h2 ) 


+ (tfA'^T* 3 + - 2^TZ 3 A«B'» 

Z \ 9 9 


g 2 v 2 


„ 2„,2 


h ( i a/2 ^/2/xrp2 , 9 r?2n->2 

- {A » A I >«) + 2cos‘e w Z ' tlh3 ' 


(4.107) 


where in the second and the third lines, Xh^hiXh (i = 1,2,3) has been written 
simply as T^. Prom this equation, we have 


P = 






h± 


Ml cos 2 e w J2 h u 2 T 2 3 ’ 


with 


TL = \(T 2 hl + T 2 h2 ) = \{f 2 h - T 2 3 ) = l (4(4 + 1) - T 2 h3 ) , 


(4.108) 


(4.109) 


where 4 is the (magnitude of) weak isospin of h. Therefore, p can take any value, 
in principle. When all Higgs multiplets belong to the same representation, as in the 
case of the supersymmetric extension of the standard model, i.e. the “Minimal Su¬ 
persymmetric Standard Model” (MSSM), Y2i v h cance l out between the numerator 
and the denominator of (4.108) and we have 


1 I h (I h + 1) - T 2 3 
P 2 T 2 3 


(4.110) 


One can easily check p = 1 for 4 = \ and T h3 = ±|, as in the case of the standard 
model or MSSM. In other words, the GWS model with minimal Higgs doublet 
necessarily predicts p = 1 at the classical (tree) level, though the p deviates from 1 
at the quantum level, as we will see in Chapter 8. 


4.5 Addition of More Leptons 


So far we have considered only one lepton family, i.e. electron e and its neutrino u e . 
The GWS model can be easily extended for the case with more lepton families. Let 
us first consider, for simplicity, the case of 2 lepton families, electron (1st generation) 
and muon (2nd generation) families, represented as 



(1st generation) (2nd generation) 


(4.111) 


The Lagrangian for this fermion sector is given by 

C F = C i ; ) +CP, (4.112) 
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where both terms have the same expression as (4.23) by replacing L and R with 
those given in (4.111), respectively. The Lagrangian for gauge field and scalar boson 
sector remains unchanged. However, as for the Yukawa interactions, new terms are 
allowed without contradicting with the requirement of SU(2)i x C/(l)y invariance, 

Ty — GppRGepLefyRn Gnf>Lji(f)Rf> T h.c. (4.113) 


Notice that here we require only the local gauge invariance and do not assume 
separately the electron number and muon number conservation to start with. After 
the spontaneous symmetry breaking, one can easily obtain the fermion mass term 
from Cy just as we did before for an electron family. 


= -(m ee e L e R + + m etl e L ft R + m^ftLeH + h.c.) 


= ~{e. L vl) 


m e 

m, 


fie 


m 

m u 


efi 


)( 


e R 

Hr 


+ h.c. 


(4.114) 


where my = Gij (i,j = e or ft). 
Note that the mass matrix 


M = 


m ee 

m^e 


m 

m 


efi 

fifi 


(4.115) 


is, in general, complex and not necessarily hermitian and is not diagonalized by use 
of one unitary matrix. In general, it can be mathematically shown that an arbitrary 
n x n complex matrix M can be diagonalized by a bi-unitary transformation as 


MV = M d , 


(4.116) 


where M d is a diagonal matrix and both of U and V are unitary matrices, which 
are the unitary matrices to diagonalize the hermitian matrices MM t and M^M: 
W(MM^)U = (M^M)V = M d M%. Redefining the phase of n and e fields 

appropriately, one can always make the elements of M d be real and positive as 

U<MV = M„=(“- m °J, (4.117) 

where m e and correspond to the masses of electron and muon, respectively. 

To follow this program, let us diagonalize the M of (4.115) by applying the 
unitary matrices, 


/ cos 9 l -sin#/, \ v — ( cos A 

\ sin#L cos Ql J ’ \ sin 0 R cos 0 R ) ' 


(4.118) 


To be precise, each of U and V may have a few phases, in addition to the mixing 
angle, 6l or 0 R . These phases, however, turn out to be physically irrelevant (see the 
discussion in the subsection 4.6.2 below). Thus here we simply ignore those phases. 
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Then the £j-" iass ' 1 can b e rewritten as 

4 m “” ) = -(c L A L)M( e *)+ h .c. 

= -(e' L p! L )M d ( e f )+h.c. (4.119) 

V Vr J 

where 

( <4 ) = [ft ( « ) = f ) ( « ) , (4,120) 

\ Hl / \ Hl J \ -sin 0 L cos 6 L J \ hl J 

( '? ) = W ( eR ) = ( C “ 9 ? Sln9 / ) ( « V (4.121) 

V Hr J \ m J \ — sin 6 R cos8 R J \ HR J 

Notice that the fields with prime (e' L R , h'l,r) which are called mass eigenstates are 

not equal to the fields without prime (bl,Ri Hl,r) which are called current/weak 
eigenstates. The observed states are mass eigenstates. Therefore, when we write 
weak charged currents interaction in terms of the observed states, the interaction 
is no longer diagonal in the generation space and we potentially have an intergen- 
erational mixing. The detail will be discussed later in section 4.6 when we discuss 
the quark sector. 

However, here we just point out that in the case of massless neutrinos the mixing 
angle 6i can be rotated away and thus, the mass eigenstates of leptons become equal 
to weak eigenstates. This can be seen considering the lepton doublet (4.111). By 
using (4.120), the weak eigenstates are written as 


( V e \ = ( U eL 

v e ) L \ cos8 L e' L - sm8 L n' L 


W = 




V^L 

sin QLe' L + cos Olh'l 


(4.122) 


As is easily shown, under the following rotation between the lepton doublets 


/ L e \ / L' e \ _ / cos a sin a \ / L e \ 
\ Lft ) \ L 'n ) V -sina cos a J \ J 

we have 


(4.123) 


Ui'fDaLt + LJ'fDaLu = L' e i 7 “£> a L' 4- L'^D^. 
Then, if we choose a = 6 l, we can write 


L' 


L' 


cos 6 L,L e + sin = 

— sin 0lL(. + cos = 


cos 8 L v e L + sin Olv^l 


■ sin O^eL + cos OlVuL 

A 


(4.124) 


(4.125) 

(4.126) 
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Here v e and are degenerate in their masses, as they are both massless, and 
therefore we can redefine the observed neutrino fields as 

v' eL = cosO L v> eL +smO L u^ L , (4.127) 

= -smd L v eL +cosQl,Vp,l, (4.128) 

and so we end up with the weak doublets, which are still diagonal in the base of 
mass eigenstates. The mass eigenstates are now written as 

L ' e= ( U e ') L ’ (4 ' 129) 

and there is no flavor mixing between electron and muon in the weak charged 
current interactions. Thus, if we define individual lepton numbers as n e (electron 
number) = +1 for e'~, v' e \ n e = -1 for e ,+ , v' t and n M (muon number) = +1 for ft'~, 
n p = — 1 f° r M ,+ > respectively, then it is naturally derived that both of the 

electron number n e and the muon number are conserved separately. The GWS 
model automatically leads to the conclusion that the decay mode ^ -4 e ± + 7 is 
strictly forbidden. It is remarkable to note that the flavor conservation was not put 
in by hand but originated from the massless nature of the neutrinos in the GWS 
model. In other words, if neutrinos are massive, one can no longer redefine the 
neutrino fields as in (4.127)-(4.128) and the weak eigenstates of neutrinos are given 
by a linear combination of the mass eigenstates v\ and 1 / 2 , which leads to neutrino 
oscillations, as we will see in Chapter 6 . (In the quark sector which will be discussed 
later in section 4.6, the mixing angle cannot be rotated away because quarks are 
massive with different masses.) 

Let us next discuss the Higgs boson couplings to electrons and muons. From 
(4.113), one can extract them in the unitary gauge as 

Lmi = -~(e L ft L ) ( mee m ^\( e *\+ hx . ( 4.i30) 

V V m pe J V m ) 

When we go to the mass eigenstate by using (4.120) and (4.121), Chu turns into 

Chu = (el/4)M d (*£ 

= -^(m e e'e' + m^fi'n'). (4.131) 

This shows that (i) Higgs couplings do not mix the lepton flavors and (ii)they are 
proportional to the masses of leptons, which couple to the Higgs. 

In conclusion, there is no lepton flavor number violation in the standard GWS 
model; the electron number n e and muon number are strictly conserved. Exten¬ 
sion of the model to tau lepton family (y T , r) is straightforward and the tau number 
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n T is also conserved. At present we have 3 generations of leptons; 



(4.132) 


In the standard GWS model, n e , n^, and n T are separately conserved. Therefore, 
the processes such as /*“ -> e~ 7 , fi~ -> e + e~e~ and fi~ + Z -A e~ + Z, etc. are 
forbidden at the all orders of perturbation in the GWS model. Therefore, if some 
of these processes are discovered, it will clearly signal the presence of some physics 
beyond the standard model, such as MSSM. 


4.6 Extension to Quarks 


The well-established observation of left-handed charged weak currents of hadrons 
suggests that the left-handed components of quark fields should be constructed into 
a doublet similarly to the case of leptons. Now we have the following 3 generations 
of quarks. 


(1st generation) 

(;),■ 
/ \ 

ur, 

dR, 


(2nd generation) 

(:).• 

cr, 

SR, 

(4.133) 

(3rd generation) 

( 1 ^ 
Uv 

*r, 

bR, 



The GWS model can be easily extended so that it can incorporate these quark 
families, though there are several differences between quarks and leptons; (i) quarks 
have three color degrees of freedom, whereas leptons are colorless. However, since 
the electroweak interactions are color-blind, one can simply suppress the color index 
of quarks in the GWS model, unless otherwise mentioned, (ii) all quarks are massive, 
while neutrinos are considered to be massless in the GWS model. Therefore, we 
have 2 right-handed singlets such as ur and d,R for each generation, compared to 
the case of lepton families where we have only one right-handed singlet such as 
eR. Thus, the Lagrangian must contain an additional term for the up-type right- 
handed singlets like ur, in order to form their mass term, (iii) charge of quarks are 
different from that of leptons and thus, keeping the relation, Q — T 3 + y, weak- 
hypercharges of quarks are also different from those of leptons. The charge Q, weak 
iso-spin (T, T 3 ) and weak-hypercharge Y of these quarks are summarized in Table 
4.2. 

To write the Lagrangian for quarks in a compact form, we introduce the following 
notation 


QLi = 



Urn, Dm, (* = 1,2,3) 


(4.134) 
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quark family 

Q 

(T, T 3 ) 

Y 

UL,CL,tL 

+ 2 
‘ 3 


+5 

dL,SL,bL 

1 

3 

(§,-§) 

+ 1 

UR,CR,tR 

+ 2 
' 3 

0 

+1 

dR,SR,bR 

1 

3 

0 

2 

3 


Table 4.2 Quantum numbers of quark families. 


where i stands for the generation: U\ = u, {/ 2 = c, U 3 = t, D\ = d, £>2 = s, D 3 = b. 

Based on Table 4.2, we can explicitly write down the Lagrangian for these quarks 
with covariant derivatives as 

3 

C F - ^2 QuiYidf, - ig ^ • A M - ^g'B^Qu 

i— 1 

3 9 

i=l 

3 1 

+ + (4.135) 

i =1 

The terms for gauge fields Co and scalar fields C s remain unchanged from the case 
of leptons. As for the Yukawa coupling, we can write the following SU(2)l x C/(l)y 
invariant terms 

C Y = ~ X)(r ^Ql^Drj + r $Q Li 4URi + h.c.) t (4.136) 

hj 

where and are Yukawa couplings which are, in general, unconstrained 
complex parameters. <p whose weak-hypercharge is —1, i.e. — —1, is defined 
from 4 > as 

j> = ir 2 cj>* = ( j , (4.137) 

where <j>* is the complex conjugate of 4> given by (4.29). Inclusion of (p in the 2nd 
term of (4.136) is allowed because the fundamental representation and its conjugate 
are equivalent in 51/(2) group (see Appendix C3), though this is not the case for 
any other SU(N) groups with N > 3. This may be easily seen as follows; Since <jAtj> 
is SU(2)l invariant, so is = (jtcp* = ^ f (—iT 2 )(iT 2 <)>*) = fteip (e: Levi-Civita 

tensor). On the other hand, anti-symmetric product of arbitrary two doublets is 
also STJ{2)l singlet, which means that <p behaves as a doublet. 
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After spontaneous symmetry breaking, moving to the unitary gauge, we have 

(4.138) 

(4.139) 




^> + ff) (o 

and we can see that the 1st and 2nd terms of (4.136) yield the mass matrices of the 
down-type and up-type quarks, respectively. 


Qu^DRj 


{U Li D Li ) 


v -f- H ( 0 

~V 2 


Drj 


V - H - 

-j=D Li D It j + -y=D L iD Rj , 


(4.140) 


Ql^Urj -» ( Uli DLi) 


v + H ( 1 


U Rj 


y/2 VO 

(4.141) 

Here, all fields in the right-hand side are ones redefined in the unitary gauge. 


v - H - 

OLiORj + -y=ULiU Rj . 


4.6.1 Quark mass matrix 

As described above, we can extract the quark mass matrices from Cy of (4.136), 

£(-«) = -Y^DLiM^DRj -YsUnM^URj + h.c. (4.142) 

i,j i,j 


with 


M 


(O) 


i-j 


r ( D )JL 
iJ V? 


M 


(u) _ r (u) 


i 3 


i>3 


y/ 2 ’ 


(4.143) 


which are, in general, complex-valued matrices. 

By applying the bi-unitary transformation to each term of (4.142) as we did in 
the lepton sector, we can diagonalize the mass matrices, and M^ u \ sepa¬ 

rately. 


tf D M {D) V D = M^ D) (4.144) 

U}jM {u) Vu = M (u) (4.145) 

where Ud,u and Vd,u are unitary, and M^ D,U ^ are diagonal matrices. Thus we have 
moved to the base of mass eigenstates: 

D' l = tf D D L , (4.146) 

D'r = V' d D r , (4.147) 


~Pufuc. T/faf/ttMtafina/ 
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U' L 

= U\jU L , 

(4.148) 

U'r 

II 

(4.149) 


where we used the matrix notation with 


Dl = 


( Dl2 \ 


D' l = 


( Si ^ 


V ■ / V • J 


(4.150) 


etc. and the fields with and without prime denote the mass eigenstates and the 
weak eigenstates, respectively. Then, we have 

£(”>q) = -D' l M^D' r - D' r M^D' l - U' l M (u) U' r - U'hM^U'l. (4.151) 

Though MW and are diagonal matrices, they are complex, in general. How¬ 
ever, if we parametrize the i-th diagonal element, using the absolute value m\ D ' U ^ 
and phase a\ D ’ U \ as 

m\ D) = m^ ) e ia ‘ D) , (4.152) 

M\ u) = m\ u) e ia< i U \ (4.153) 


we can absorb all these phase factors into new quark fields induced by a chiral 1/(1) 
transformation, which keeps physics unchanged. 


D'u 

d°\ D) />D' Li , 

(4.154) 

D'm 

e-^ 0) /*D' Ri , 

(4.155) 

U'u 


(4.156) 

u' Ri 

e~ ia i U) ^U' Ri . 

(4.157) 


Then, finally the mass terms read as 

£("“») = -D'm^D' - U'mMu', 
where m!' D ' u ^ are real and diagonal, 


( m d 

0 

° ^ 

/ m u 

0 

° \ 

o 

m s 

0 

, = 0 

m c 

° 

V o 

0 

m b y 

V o 

0 

m t ) 


(4.158) 


(4.159) 


and D' and U‘ are down-type and up-type quarks in the mass eigenstates, respec¬ 
tively. 


D' = 




U’ = 


d 

t' 



(4.160) 
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In (4.159), the matrix element m* represents the quark mass of the i-th flavor. 

In the above discussion on the GWS model, we have implicitly assumed that 
we have only electroweak interaction as the gauge interaction. However, if we take 
strong interactions of quarks into account, must have another piece induced 

by the QCD anomaly of the chiral U (1) current; this is related to the problem called 
“strong CP problem”. (See, for example, the review by Peccei (Peccei, 1989).) 


4.6.2 Flavor mixing 

4.6.2.1 The case of charged current interaction 

Now let us discuss the charged weak current interactions, which can be extracted 
from the first line of (4.135) 

4c = E ULi'fD Li W+ + ^ E DurUuW-, (4.161) 

where the sum of i is taken for all generations. For a while, we let the number of 
generations, which we denote by n, be arbitrary, for generality. In terms of mass 
eigenstates defined in (4.146)-(4.149), we can write this C^c as 

4c = -^U , L U\'fU D D , L W+ + -^& L l4>'fU L U , L W- , (4.162) 

where D' L etc. are column vectors defined in (4.150), though now they are n- 
component vectors. Then, defining a flavor mixing matrix V, which is unitary, 

V = U\jU D , (4.163) 


we can write 


„(Q) _ 

-CC ~ 


JLu' L rVD' L Wi + JL[>' L v<-fUiw; 




7 5 )VD'W+ + h.c. 


(4.164) 


Here we are interested in how many physically meaningful independent parameters 
exist in V. V is a n x n unitary matrix with n 2 real parameters. (Because of the 
unitarity condition of V, there are n 2 relations among 2n 2 parameters of V. Then 
the number of remaining parameters is n 2 .) As is well-known, a real unitary matrix 
is an orthogonal matrix with n(n— 1)/2 parameters. Then, V has n(n — 1)/2 angles 
and n 2 — n(n — 1)/2 = n(n + l)/2 phases. However, some of these phases can be 
absorbed into the new quark fields redefined by phase transformation; for example, 
when we consider 


U' l1 »VD' l 


= U' L1 Y{VnD' L1 + V l2 D' L2 + • • • + V ln D' Ln ) 
+ U' L2 Y{V2iD' L1 + V 22 D' L2 + --- + V 2n D ' Ln ) 
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+ . 

+ U' Ln r(V nl D ' L1 + V n 2 D , L2 + --- + V nn D , Ln ), (4.165) 

the n components of the 1st column (Vn, V 21 , ■ ■ •, V n i ) 1 of the mixing matrix V can 
be made real by writing Vn = \Vn |e ta * and redefining the up-type quark fields as 
U' Li -4 e %a i U' Li ( i = 1,2, • • ■,n). Similarly, we can make the (n — 1) components 
of the 1st row(Vi 2 , V 3 , • • •, Vi n ) real in the redefinition of D' Li -4 e~‘ a ‘ ) D' Li (i = 
2,3, (Note that Vn = | Vn | has already been made real by the redefini¬ 

tion of U' L1 .) After this procedure, (2n - 1) phases are removed from V without 
changing physical consequences. After all, remaining parameters are characterized 
by n(n — l )/2 angles and n(n + l )/2 — ( 2 n — 1 ) = (n — l)(n — 2)/2 phases. 


• Four quark case (n = 2) 


Here the quark mixing matrix V can be parametrized by only one flavor mixing 
angle 0 C , called Cabibbo angle. There is no phase (= 0 for n = 2) 
and hence we have no CP violation in this case because of the absence of complex 
coupling constants. The charged weak current interaction can be written as 


with 


Ccc = fyu 2 ) 7 M ^-y^V ( d g ) W+ + h.c. 


( V ud 

Vus \ / 

cos 9 C 

sin6 c \ 

V v cd 

V cs ) l 

- sin 6 C 

COS0 c ) 


(4.166) 


(4.167) 


Here and hereafter, the mass eigenstates of quarks are simply denoted by u, d, etc., 
suppressing the prime '. 

This means that the left-handed quark doublets are written as 



where 


(4.168) 


( d c \ _ / cos 6 C 

\ s c ) L V -sin0 c 



(4.169) 


and the weak charged current processes are due to the transition u 44 d c or c 44 s c . 


• GIM mechanism 


In 1960s, only one quark doublet was known from the analysis of the jd decay of 
neutron and A which occur via weak charged current transition u 44 d c between 






80 


THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS 


the members of the doublet, 

^ )= ( d c ( d cos 9 C + s sin 9 C ) L ’ (4.170) 

At that time, the orthogonal combination of dL and sl, i.e. s c = s cos 0 C — dsin8 c , 
was left as a SU(2) L singlet. Then, when we apply this to neutral currents, we have 

<9l ) 7m= ^(ul 1 u.u l - cos 2 9 c d L l^d L - sin 2 9 c si^ li s L 

- cosO c sm6 c (d L 'Y fl SL + SLlndL)), (4-171) 

where the final term corresponds to a strangeness changing neutral current. (We can 
check that the part of the neutral current, proportional to the electric charge, does 
not contain such strangeness changing neutral current.) If this term really exists, we 
necessarily predict a rather large decay width of the flavor changing neutral current 
transition such as K £ — » n + fi~ or K + -A n + vv, since flavor changing sdZ and dsZ 
vertices appear already at the classical (tree) level. But this prediction is completely 
in disagreement with experimental results, Br(K° -A- = (7.25 ± 0.16) x 

10~ 9 , or Br(K + -A 7 t + Du) = (1.6^qJ) x 10 _1 °. (See Chapter 9 for some detailed 
discussions on these flavor (strangeness) changing neutral current processes.) 

In 1970, Glashow, Iliopuolos and Maiani(GIM) (Glashow, Iliopuolos and Maiani, 
1970) proposed a new mechanism to solve this problem by introducing the 2nd quark 
doublet which contains the 4-th quark, which is now called the charm quark c, 



c 

s cos 9 C — d sin 9 C 



(4.172) 


which produces additional neutral current, 

= ^( 5 £7m c £ “ cos2 OcS'Yv.sl + sin 2 9 c d L ~i tl d L 

+ cos 9 c sm9 c (dL^SL + «l7m^£))- 


(4.173) 


Summing up (4.171) and (4.173), we find that the neutral current finally becomes 
flavor diagonal; 

Q ( l7^Q ( L +Ql ) 1^Q ( L = -^{UL^UL+CL^CL-dL^dL-SL^SL), (4.174) 

and we have no flavor changing neutral current at all at the tree level. 

Then, how the flavor changing neutral current transitions are possible, even 
though it is extremely suppressed? GIM proposed a wonderful idea that it will occur 
at the quantum level via, for example, the higher order charged current interaction 
shown in Fig. 4.3 (for the K° L -A decay). The contribution of the diagram 


~Pufuc. %af/u>Mia£uraZ 
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sin 6 C 

iWWVr 
W+ 


W~ 
J'WVW 


cos 6 C 


(a) 



cos 8 C 

rvww 

w+ 


W~ 

Wwvl 


— sin 6 C 

(b) 

Fig. 4.3 




(a) to the decay amplitude behave, ignoring the powers of small -Sf- and -jSf-, as 

m w iVl W 

.. _ 4 . g 4 sin 0 c cos 0 c _ . . . 

) oc ———2 -« GFasin0 c cos0 c , (4.175) 

M w 

where a = e 2 /Ait is the fine structure constant. But this contribution, though 
suppressed by the factor a, is still too large to account for the decay width of 
the “rare process”. However, the contribution of the 2nd doublet introduced by 
GIM, as shown in the diagram (b), yields a contribution, — 3 sm .^ c —-, which is 
just opposite to that of (a). Thus the amplitude exactly vanishes at the leading 

m 2 

order of the perturbation in the powers of t rf- (q = u, c), and therefore the non- 

M w 

m 2 

vanishing contribution stems from the 1 st order of yielding a suppression factor 

m w 

2 _ 2 

m %pr l k ~ 3 x 10 -4 . This mechanism of suppressing flavor (strangeness) changing 
neutral current is called GIM mechanism. Hence, the amplitude behaves as 


A( k l -+ ) oc 


g 4 sin 9 C cos 6 C 

Ml 


m 2 - m 2 

M 2 W 


G 2 p sin# c cos0 c (m;5 — m^). (4.176) 


Therefore, if ttIq — ?n^, g, decay is strictly forbidden. Gaillard and Lee 

(Gaillard and Lee, 1974) predicted the mass of the introduced c quark as the value 
to reproduce, via the above formula, the experimental value of Br(K° —> n + fi~), 
before the discovery of the c quark at the collider experiments. The predicted 
value was quite consistent with the value ( m c rs 1.5 GeV) determined from the 
spectroscopy of charmonium. More detailed discussion of these rare processes in the 


Ptihc 7fea.fAcmnLtc.af 
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full six quark (3 generation) Kobayashi-Maskawa model will be given in Chapter 9. 
It is remarkable that the GIM mechanism was proposed before discovery of J/if) in 


1974. 


• Six quark case (n = 3) 


Here V has 3 angles and 1 phase. Because of the existence of 1 phase, we 
can expect the CP violation in this case. The extension of the model to the 3 
generation scheme, in order to accommodate the observed CP violation in Kl decay, 
was first proposed by Kobayashi and Maskawa (Kobayashi and Maskawa, 1973). 
They generalized the Cabibbo mixing matrix, (4.167), to the six quark case of 
3 generations and thus the proposed 3x3 mixing matrix V is called Cabibbo- 
Kobayashi-Maskawa(CKM) matrix. It is a remarkable fact that the proposal by 
Kobayashi and Maskawa was made before the discovery of the c quark in the 2nd 
generation! The GWS model extended to the Kobayashi-Maskawa 3 generation 
model embodies the standard model of electroweak interaction nowadays. In this 
case the charged weak current interaction becomes 


C c c = 77^ uc 1 2* V 


d \ 

s \W+ + h.c. 

b ) 


(4.177) 


There are several pamaretrizations of the CKM matrix V, which are physically 
equivalent. Using 3 flavor (generation) mixing angles, 6 1 , 9 2 , # 3 , and a CP violating 
phase, 5, one example is given a, la Kobayashi and Maskawa by 


V = R 1 {e 2 )R 3 (6 1 )C(0,0,6)R 1 (e 3 ), 
where Ri is a rotation matrix around the axis i 

/ 1 0 0 \ ( Ci si 0 \ 

Ri(0i) = 0 Ci Si J , R 3 {0i) = -Si Ci 0 

\ 0 -Si Ci J V 0 0 1/ 

with Ci — cos 0i and s i — sin 6i and 

/! ° ° \ 

C(0,0,S) =01 0 . 

V 0 0 e iS j 

V is explicitly written as 

( Cl SlC 3 Si s 3 \ 

-sic 2 C 1 C 2 C 3 - s 2 s 3 e ld C 1 C 2 S 3 + s 2 c 3 e lS 
sis 2 -C 1 S 2 C 3 - c 2 s 3 e td -cis 2 s 3 + c 2 c 3 e' 5 J 


(4.178) 


(4.179) 


(4.180) 


(4.181) 


where the values of the 3 angles and the phase are not predictable within the 
framework of GWS model; they must be extracted from experiments. A phase <5 
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cannot be removed by the redefinition of quark fields and leads to the CP violation 
of the processes including It is noted that the phase <5 disappears if any of 6 i, 
6 2 , 0 3 vanishes; If 63 = 0, Ri(9 3 ) = 1. Then, the phase 5 can be absorbed into 


the newly defined fields D' = (7(0,0,<5)77, where D = 


d 

s 

b 


i.e. by “re-phasing” 


of the quark fields. For the case of 0 2 = 0, using the relation f?3(#i)(7(0,0, (5) = 
C( 0 , 0 ,S)R 3 ( 0 x), the phase 6 can be also removed by redefining the new field U' ~ 

u 

l • For 9x = 0, we have V = Ri(6 2 )C(0, 0, J)i2i(0 s ) 


(7(0,0, —<5)1/ with U = 
which leads to 


V = 


1 0 0 \ 

0 a b I , 

0 c d ) 


(4.182) 


where the 2x2 submatrix in V can be made real by suitable re-phasing, just as in 
the case of n = 2 . 

Another parameterization of V is the so-called “standard” parameterization 
which is taken by the Particle Data Group (Particle Data Group, 2002) and is 
characterized in terms of 3 angles # 12 , 0 2 3 , 9 13 and a phase $13 as 

( C 12 C 13 S 12 C 13 si3e~ iSl3 \ 

— S 12 C 23 — Ci2S23Sl3e tl513 C 12 C 23 — Sl2 s 23Sl3e 1<513 S 23 C 13 1 , (4.183) 

S 12 S 23 — Ci2C23Sl3e 81513 —C 12 S 23 — Si 2 C 2 3Si3e tSl3 C 23 C 13 J 

where Cij = cos Oij and Sij = sin 6ij (i,j = 1,2,3). 

A convenient way to approximate V has been proposed by Wolfenstein (Wolfen- 
stein, 1983); it is a parameterization by expanding each element of V in the powers 
of the sine of Cabibbo angle, A = S 12 — 0.22. In the approximation up to the order 
of A 3 , it is written as 

/ 1 - A 2 /2 A A\ z (p — irj) \ 

V = I -A 1 - A 2 /2 AA 2 , (4.184) 

\ AA 3 (1 — p - ir)) —AX 2 1 / 

where A, p and rj are real parameters of the order one, and r; corresponds to the 
CP violating phase. This is often used in phenomenological analysis because this 
parameterization shows clearly the hierarchical structure of the CKM matrix. 


4.6.2.2 The case of neutral current interaction 

Let us next consider the situation of neutral current interactions, 

4$ = eJ e r A>1 + - sin2 (4-185) 

COS t/\y 
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where 

J e , m = \u ltl U- l -D ltl D, 

Jl = ^{UlIhUl - D l ^D l ). (4.186) 

These currents remain flavor diagonal, even in the base of mass eigenstates, because 
of the unitarity of Uu,Ud, Vy and Vd- For example, one can easily see 

U^U = Ul^Ul + Ur^Ur, 

= U' L Ul^UuU' L + U' R Vl^VuU' R , 

= Urr' L + Ur 1»U' r , 

= U'^U'. (4.187) 

Therefore, we have no flavor changing neutral current at the tree level. The kinetic 
terms of quarks are also unchanged when we move to the mass eigenstates. Thus, 
we have no CP violation solely due to the weak neutral current and electromagnetic 
interactions. 

What about Higgs boson couplings to quarks? From (4.133), (4.140) and (4.141), 
one can extract the Higgs-quark interactions in the unitary gauge as 

H - 

C R qq =-( m u uu + mddd + m s ss + rribbb + m c cc + mitt), (4.188) 

which is also flavor diagonal. Then, the Higgs couplings to quarks in the standard 
model with one Higgs doublet do not generate CP violation. It should be noted 
that this is not the case in the models with multi-Higgs, such as the models of spon¬ 
taneous CP violation discussed by Lee and also by Weinberg (Lee 1974; Weinberg 
1976). 

4.7 Anomalies 

Renormalizability is an essential principle for the theory to be meaningful at the 
quantum (loop) level. It is realized by sophisticated cancellation mechanism implied 
by gauge invariance, which is related to the current conservation called Ward- 
Takahashi identities (Ward, 1950; Takahashi, 1957). However, in field theories 
there can be the case that a conservation law holding at the tree level is violated 
at the quantum level by the contributions of loop diagrams. This is known as 
anomalies. A typical example is the triangle anomaly which is offen called the 
Adler-Bell-Jackiw(ABJ) anomaly or the 75 or axial anomaly (Adler, 1969; Bell and 
Jackiw, 1969; Bardeen, 1969). The “gauge anomaly”, the anomaly of local gauge 
symmetry, is originated from the (linear) divergence of a fermion loop diagram of 
triangle shape with 2 vector and 1 axial vector currents at three vertices, as shown 
in Fig. 4.4. If this anomaly remains, the renormalizability of the theory is spoilt 
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Fig. 4.4 


and the model becomes meaningless at the quantum level (Georgi and Glashow, 
1974; Gross and Jackiw, 1972; Bouchiat, Iliopoulos and Meyer, 1972). In general, 
anomaly cancellation is not trivial for the models with parity violation, containing 
both vector and axial vector currents, in contrast to the case of e.g. QED where 
only vector current exists. Thus, it is very interesting to examine if the anomaly 
cancellation works for the GWS model which has both vector and axial vector 
currents. 

Let us consider, in general, the Lagrangian describing an interaction of left- 
handed and right-handed fermions with gauge bosons A 1 , 

C = - igTlA^L + - igT^A^R, (4.189) 

where the summation over i is implied. ipL and i])r are column vectors composed 
of left-handed and right-handed fermions, belonging to some representations of the 
gauge group, and T[ and Tk are the corresponding generators of the gauge group 
G satisfying 

TLriTIr] =ifi jk T£ <R , (4.190) 

for each of and where Ujk is the structure constant of G. Then, the currents 
coupled to gauge bosons are given by 

= i>L^ T ii> L + tPr^T'rxPr, 

= 75)21 V> + ^(1 + 75)7^- (4-191) 

Now it is known that the triangle anomaly associated with the triangle diagram 
with external gauge bosons A 1 , , A k is proportional to 

A ijk = A% k - A % k , (4.192) 

where the contributions of the right-handed and left-handed fermions are given by 

Af L = Tr ({T l R ,L,T^ L }T k <L ) , (4.193) 

and do not depend on the fermion masses. The anti-commutator in the above 
equation comes from the exchange of A 1 and A* gauge bosons, which are assumed 
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to couple to the vector currents at the two vertices. Then, the condition for being 
anomaly free is 

TV ({TA,T£}T*) - TV ({TlTi}Tt) = 0. (4.194) 

For the vector (T£ = T R ) or vectorlike (T[ = U~ i T l R U with a unitary matrix 
U) gauge theories*, the anomaly free condition (4.194) is automatically satisfied. 
Typical examples of the vector theories are QED and QCD, both conserving parity 
symmetry. Another possibility to satisfy (4.194) is the case 

Af = A i l k = 0. (4.195) 

This is realized, for example, for the gauge group of 517(2); 

=0 (4.196) 

due to {t’,T' 7 } = 2 Sij. This property essentially comes from the fact that the 

representation of SU (2) is “real”, and therefore the contributions of a Weyl fermion 
ip and {ip) c , with c standing for the charge conjugation, should be the same, while 
their contributions to the anomaly should have opposite signs, as their chiralities 
are different. 

Then, how about the GWS model? In the GWS model with only one family 
of leptons, matter fields are the left-handed doublet (i^ei, e Z)* an d a right-handed 
singlet e R and the generators for the gauge group are given by 

Tl = j, y ; r R = (4.197) 

It is evident that Tr ({-y, 7r} ?y) = 0 as shown above. Then, for the model to be 
anomaly free we must impose 

Tr ({~ t j}Yl) = \SijT.W L = i«5 ij TV(2Q - r 3 ) = <%TrQ = 0, (4.198) 

and 

TrY^ - Tr 1% = 0. (4.199) 

(Note that another possible anomaly associated with a diagram with currents ac¬ 
companied by single r l and two weak-hypercharge generators Y simply vanishes, 
thanks to the relation Trr 1 =0.) Here for right-handed singlets, Q = Yb. anc l thus 
we have 

TrYfl = 8Tr Q 3 oc Tr Q, (4.200) 

‘Currents are written as jp — '4>LTy.T\ j il>L + $RT^T^ip r — $ lT iiT l h ip l + ■ipii‘y^UTlU~ 1 ipR = 
V' 7 /iT’V’, where ip = ipi + U~ 1 ipR and T l = T' L . 
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since Q 3 = Q for leptons, and for left-handed doublets, we see 
TrYl = Tr(2Q — r 3 ) 3 

= 8TrQ 3 + 6Tr (Q(r 3 ) 2 ) - 6 Tt(Q 2 t 3 ) - Tr ((r 3 ) 3 ) 

oc TrQ, (4.201) 

where we have used relations (t 3 ) 2 = 1, Tr (t 3 ) 3 = 0 and Tr(Q 2 r 3 ) = TrQ for 
leptons. After all, for the GWS model to be anomaly free, it is necessary to have 

TrQ = 0. (4.202) 

This condition is not satisfied in the GWS model only with leptons and should be 
satisfied by extending the model so that it includes quarks with 3 color degree of 
freedom. For example, for the 1st generation model of leptons and quarks with N c 
color degree of freedom 

5 

L 

the condition of anomaly cancellation 

TVQ = 0 - 1 + N c q - i) - 0 (4.204) 

can be satisfied only for N c = 3, where 0, —1, | and — | represent the charges 
of u e , e, u and d, respectively. (Though the statement that (4.199) is equivalent 
to (4.202) was made for the pure leptonic model, it turns out to hold even after 
the inclusion of the quarks.) The lepton anomaly can be cancelled by the quark 
anomaly only when the color degree of freedom is 3. Therefore, the standard model 
with G — SU(3) C x SU(2)l x U(1)y suggests that leptons and quarks are closely 
related to each other. The real unification of quarks and leptons in the multiplets 
of gauge group is realized in the “Grand Unified Theories (GUT)” to unify strong 
and electroweak interactions, such as SU (5) or 50(10) gauge theories. 



,UR,dR, 


(4.203) 


Problems 

4.1 Prove the commutation relations [Q - T 3 , T l ] = 0 for i = 1,2,3, where Q is the 
electric charge operator defined by (4.18) and T l are weak isospin operators defined 
by (4.13). 

4.2 Show that the Yukawa interaction defined by (4.33) is invariant under SU{2)l x 
U (l)y gauge transformation. 

4.3 Show that in the GWS model, the electric charge operator Q is not a broken 
generator, though weak isospin operators T 1 violate the symmetry of the vacuum 
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given by (4.36). 

4.4 Derive (4.68). 

4.5 Show that arbitrary nxn complex matrices can be diagonalized by bi-unitary 
tarnsformations. 


4.6 The lowest order decay amplitude of W -* i&t (£ = e, //, r) is given by 

M = 2 ~ A), 

where e /i (fc, A) is the polarization vector of the W boson with helicity A. Then, 
neglecting the lepton masses, calculate the decay width for this decay which is 
given as 


r(W -> Ivi) = 


G F M*y 


4.7 The interaction between Z° and a fermion / is given by 


r z _ 

NC ~ 


_ 2 . _ j z zm 

cos o w 11 ’ 


where f is given by (4.97). 

(l)Neglecting the fermion mass, calculate the decay width of Z° -y ff in the tree 
level, which is given as 

r (Z° -y ff) = ((4) 2 + (C') 2 ) , 


where Cy, C A are given in (4.98)~(4.101). 

(2) Since the top quark is heavier than Z °, Z° cannot decay into a top quark pair. 
Then, assuming sin 2 9w — 0.22, estimate the branching ratio of Z° -y e + e“. 
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Quantum chromodynamics (QCD) is the theory of strong interactions which de¬ 
scribes the dynamics of quarks and gluons. QCD is the non-Abelian gauge theory 
with SU (3) color symmetry. In 1930s, the idea of strong interactions was first in¬ 
troduced by Yukawa to explain new strong forces those days, called nuclear forces, 
between nucleons mediated by pions. Before long, many hadrons, in addition to nu¬ 
cleons and pions, were discovered and it was revealed that the interactions among 
those hadrons were very complicated and did not seem to be fundamental. 

Now, we know that all hadrons are composite particles of quarks which were 
first introduced by Gell-Mann and Zweig in 1964 (Gell-Mann, 1964; Zweig, 1964) 
to explain the spectroscopy of hadrons those days. The quarks were established 
as the fundamental constituents of hadrons in the development of the quark model 
in 1960s and 1970s; baryons are composed of three quarks qqq and mesons of a 
quark q and an antiquark q. The quarks are bound inside hadrons through strong 
interactions mediated by gluons with the coupling of color charges g s \-, where 
A 1 (i = 1,2, •••,8) are the 3x3 Gell-Mann matrices as explicitly presented in 
Appendix (C.4). The situation in hadrons is quite similar to the case of positronium 
where the electromagnetic Coulomb forces between a positron and an electron are 
mediated by photons with the coupling of electric charge e. It is well-known that 
interactions between electrons (or positrons) and photons are beautifully bescribed 
by quantum elecrodynamics(QED), i.e. the gauge theory of 17(1) symmetry called 
U(l)em, which is invariant under local Abelian [/(1) phase transformation. QCD 
which describes the strong interactions between quarks and gluons is also the gauge 
theory but this time it is not an Abelian theory but a non-Abelian theory of SU (3) 
color symmetry, described as SU (3) c , which is invariant under local non-Abelian 
SU( 3) transformation in color space. Notice that the color being in 3 species as 
7?.(red), 77(blue) and G(green), is unrelated to another attribute of quarks, i.e. the 
flavor, which is in 6 species (u , d, s, c, b, t ). In contrast to the flavor symmetry 
like the SU (3) / symmetry among u. d and s quarks which is rather largely broken, 
the color SU( 3) c symmetry is unbroken and exact as well as the U(l) em symmetry 
in QED. Therefore, just like photons in QED, gluons as gauge bosons mediating 
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strong interactions are also massless. 

Since there exist no colored hadrons in Nature, we assume that all observed 
hadrons must be colorless, i.e. in the color singlet states. This seems to be quite sim¬ 
ilar to the case of isospin states for two-nucleon systems. As is well known, though 
we can expect, at a glance, to have two different isospin states I — 1 (isotriplet) and 
I = O(isosinglet) for the two-nucleon systems composed of a proton and a neutron, 
actually only 7 = 0 state is allowed to exist as a deuteron. However, one should 
notice that there is a big difference between hadrons and deuteron; though in the 
case of deuteron, a proton and a neutron which are constituents of deuteron exist 
separately in Nature, colored quarks which are constituents of hadrons do not exist 
separately in Nature but are confined inside hadrons as colorless states. This is the 
so-called confinement problem which should be explained in QCD. 

In this chapter, after describing the evidence of the color degree of freedom, we 
discuss the coupling strength of color forces between two quarks and also between a 
quark and an antiquark. Then, we discuss the running coupling constants in QED 
and QCD cases and discuss the difference between QED and QCD. 


5.1 Evidence for colors 

There are several evidences for existence of the color degree of freedom, which is 
actually in 3 species. The color was first introduced to solve the difficulty in the 
relation of spin and statistics in the baryon spectroscopy. In the quark model, 
baryons are made of three spin | quarks. Without color space, the wavefunction 
of a baryon is described by the product of space-, spin- and flavor-wavefunctions. 
Then, consider the A ++ (1232) with spin |. It is the ground state of the uuu 
bound system and hence its wavefunction is totally symmeric under interchange 
of any u quark pairs in space-, spin- and flavor-wavefunctions. However, since the 
A ++ (1232) is a fermion with spin |, its total wavefunction must be antisymmetric 
under interchange of any u quark pairs. This difficulty can be solved by introducing 
an antisymmetric color space wavefunction, where the color space is a new internal 
space. 

A more direct evidence for color to be in 3 species comes from the experimental 
data on e + e“ annihilations at high energies. Based on the quark parton model, the 
ratio 


cr(e + e —> hadrons ) 
cr(e+e- -A M+/0 


(5.1) 


is predicted to be R = e q> where e q is the electric charge of the quark q in quark 
pairs produced in the e + e _ annihilation. Beyond the s quark pair production 
threshold but for not so high energies, only u, d and s quarks contribute to this 
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Fig. 5.1 


ratio R and yield 

2 

R = e 2 u + e 2 d + e 2 = -, (without color) 
o 

R = 2. (with 3 colors) (5.2) 


Similarly, for higher energies beyond the c quark pair production threshold, R be¬ 
comes 

R = e 2 u + e 2 d + el + ^, (without color) 

R = —, (with 3 colors) (5.3) 

O 

and for more higher energies beyond the b quark pair production, it becomes 

R = el + e% + e 2 -\-e 2 +e$ = ^-, (without color) 

R = (with 3 colors) (5.4) 

o 

Experimental data prefer the 3 colors for any cases. 

Another important observation suggesting the color number to be 3 is the decay 
rate of ir° —> 77 . The process proceeds through the coupling of the pion to a quark 
loop as shown in Fig. 5.1, in which the u and d quark loops contribute to this decay 
kinematically. The calculated result is given by 

= "?(«S-4)’(f)’ jgjr. <«) 

where N c is the number of colors and f n ~ 130MeV is the pion decay constant. 
Then, we can predict the rate to be 


r(7T° -> 



0.86 eV, (without color ( N c = 1)) 
7.65 eV. (with 3 colors ( N c = 3)) 


(5.6) 


The measured rate is (7.74 ± 0.55) eV which agrees with N c = 3. 

Further example is the branching ratio of the r lepton, Br = . A 

t~ lepton decays into kinematically allowed lepton pairs, (e~ P e ), (/j,~ P m ), and 
quark pairs, (d u), (s u), through the diagram shown in Fig. 5.2. Among these 
channels, the contribution of (s u) is very small because it is a Cabibbo-suppressed 
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S 


u 


\ 


process and hence can be neglected. Since the coupling is same for all these channels, 
we can simply predict the above branching ratio as 

Br = i (without color) 

O 

Br = 7 . (with 3 colors) (5.7) 

5 

The experimental data is Br = (17.84 ±0.06)% which is again in agreement with 3 
colors. 


5.2 QCD Lagrangian and the strength of color farces 

The QCD Lagrangian describing the interaction between quarks q and gluons Aj \ 
(i = 1,2, • • •, 8) is given by 

C = q{ilp-m)q--fi v F i ^ (5.8) 

where the summation over i ( i = 1,2,•••,8 ) is implied. The quark field q is 
given in both the Dirac space and the color space with three color components 
( q R \ 

as q — I q G , where the superscripts, R, G and B, denote red, green and brue, 

\ q B ) 

respectively. The covariant derivative is given by 

D fl = d li -ig s ^A l tl , (5.9) 

where g s is the strong coupling constant and A* are the 3x3 Gell-Mann matrices 
presented in Appendix (C.4). The summation over i (i — 1,2, • • •, 8) is implied. 
are the field strength tensor for gluon fields A % and given by 

= a M At - a„A; + s ./ ljfc AjA 5 , (5.10) 

where fijk are the structure constant of the color SU (3) c group. 
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Here we are interested in the strength of color interactions between two quarks 
and also between a quark and an antiquark. Using the explicit expression of A* 
presented in (C.4), the interaction between quarks and gluons given in (5.8) is 
written as 

\i 

£-int — OsQ'Yfi 

= J ^{q R rq G Gl + q G rq R Gl 
+ -^=(?V«* - m 9°)G£ + q R rq B Gl + q B ^q R G\ 

+ f^q B Gl + q B ^q G Gl 

+ -^(^7V + ^ G -2g B 7V)G«}, (5.11) 

where 

Gl = (Al-iAD/V 2, Gl = (Al + iAl)/V2, G® = A* 

Gft = (A* - iA® )/v^, G® = (Aj + iA*)/y/2i (5.12) 

Gl = (Al-iAl)/V2, Gl = (Al+iAl)/V2, G® = A® 

represent redefined gluon fields. As can been seen here, the gluons come in 8 different 
color combinations, EG, GE, ^(EE - GG), RB, BR, GB, BG, ^(EE + GG - 
2EE). This means that the gluons belong to a color SU(3) C octet. For example, 
Gj, changes the color of the quark from G(green) to E(red). Another combination, 
i.e. the color singlet ~^(EE + GG + BB ), does not mediate color charge. 

Now we are concerned with the interactions between two quarks. First let us 
consider the interaction between E(red) and E(red) which arises from 2 diagrams 
due to G^ and G® exchange as shown in Fig. 5.3(a) and (b). Then, the coupling of 
this interaction is obtained from (5.11) as 



The interaction between B (green) and B (green) arises only from G® exchange as 
shown in Fig. 5.3(c) and the coupling becomes 


(-2-h.Y = 

\ V 6 V 2 J 3 VV 2 ) 


(5.14) 


The interaction between E(red) and G(green) arises from 3 diagrams shown in Fig. 
5.3(d), (e) and (f) and the coupling becomes 




2 

3 



(5.15) 
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(e) (f) 


Fig. 5.3 

As seen in the above examples, all color interactions between two quarks (color- 
color interactions) are equal as expected from exact color symmetry. In other words, 
two quark interactions are blind to the difference of colors. 

What about the interactions between a quark and an antiquark (color-anti-color 
interactions) in the color-singlet state, 4 =(RR + GG + BB), i.e. between q and q 
in a meson? Here, since each color is in equal weight in the color-singlet state of a 
meson, it is enough to consider, for example, the case of H(blue) and H (anti-blue) 
whose interaction arises from 3 diagrams shown in Fig. 5.4(a), (b) and (c). (The 
situation is same for between f?(red) and i?(red) and also between G'(green) and 
G'(green).) Then, the coupling of this interaction becomes 



(5.16) 


Here we should notice that the coupling between antiquark and gluon is in the 
opposite sign to the one between quark and gluon because of vector nature of gluons, 
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9 ig s i 9 


Fig. 5.5 

just as in QED mediated by a photon where the charge of positron (anti-particle 
of electron) is opposite to the electron charge. We have the same contribution of 
RR and GG as the one of BB obtained in (5.16). However, when we consider the 
color-singlet state of mesons, each of initial and final states in Fig. 5.4 has a factor 

■^j. Then, taking into account of these factors as 3 x = 1, the coupling for 

qq interactions in a meson becomes the same as given in (5.16). 

The similarity of QCD and QED is in many cases useful in calculating processes 
in the tree approximation. Let us take one such example, a process, q q + g (Fig. 
5.5). The matrix element of this process is given as 

M = n a Ai, (5.17) 

which is essentially same for the QED process, q —> q + 7 except for the coupling 
constant and the color matrix factor, a and b denote the color indices. Squaring 
the above matrix element, one can extract the color-dependent factor as 

t^Tt [yyj = |<7*, (5.18) 
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Fig. 5.6 


using the relation Tr = 4, where the summation over i (i — 1,2, • • • ,8) is 

implied. In (5.18), a factor | is due to averaging over the initial quark’s color states 
(=3). Then, the transformation from QED process to QCD process can be done 
simply by replacing a by |a s in the transition probability for the corresponding 
QED process. 


5.3 Running coupling constants 
5.3.1 Renormalized charge in QED 

In QED, an electron shows many guises; an electron virtually emits photons which 
produce electron-positron pairs and those pairs emit further photons and so on. 
Hence, an electron turns out to be surrounded by many virtual electrons and 
positrons and due to the Coulomb attractive force, positons become closer to the 
original elecrton (which we call the “bare” electron) and thus the vacuum is polar¬ 
ized as shown in Fig. 5.6. Because of this vacuum polarization, the effective charge 

(observed charge) of an electron decreases as we go away from the bare electron, 

2 

that is to say, the observed charge e or the fine-structure constant, a = |^, is not 
a constant but depends on the distance r from the bare charge or the momentum 
transfer Q 2 in scattering by a test charge. In fact, the value of a(Q 2 ) decreases 
with decreasing Q 2 (or increasing r) and thus the vacuum polarization leads to the 
charge screening as shown in Fig. 5.7. 
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large Q 2 •*- -► small Q 2 

(small r) (large r) 


Fig. 5.7 


e e 



To understnad this property in QED, let us consider the electron-electron scat¬ 
tering, e~(fci) + e~(fc 2 ) -> e“(fc() + e~(k' 2 ). The lowest order amplitude, being in 
0(a), for this process is given by (see Fig. 5.8) 

—iM = [ieu(k[)j tl u(ki)\—^^-[ieu(k! i )'y u u(k2)\, (5.19) 

where q = k\ — k[ is the vertual photon momentum. However, to get an exact am¬ 
plitude for this scattering, we need to calculate higher order corrections containing 
all order of a. The first order correction to the above amplitude comes from the 
diagram of Fig. 5.9, in which the photon propagator contains an electron loop, and 
according to the standard Feynman rule, it is written as 

-iM = (~l)[feu(fc() 7 p u(fci)] 
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e e 



-ig\v 


■jd'p 


Tr 


i^+m) A Hi-t + m) 


m* 


(p - q) 2 




ieu(k' 2 )Yu(k 2 )}, 


(5.20) 


where p is the loop momentum of the internal fermion(electron) and m denotes the 
electron mass. The factor (-1) is originated from the fact that this diagram con¬ 
tains one fermion loop. It should be noticed that the internal momentum p should 
be integrated from 0 to oo and leads to the divergence of the amplitude. (Exact 
calculation results in only a logarithmic divergence, though one might expect it 
to be quadratic because of the apparent form / 0 °°pdp of the integral in (5.20).) 
This divergence can be removed in QED by the renormalization technique as dis¬ 
cussed below. Note that though the amplitude (5.20) is of the 0(a 2 ), this one-loop 
correction to the photon propagator is of the 0(a). 

Now, one might worry about additional contributions of the same 0(a) origi¬ 
nated from other diagrams depicted in Fig. 5.10, where (a) and (b) are the self¬ 
energy terms of electrons and (c) is the vertex correction. However, to our surprise, 
QED resuts in exact cancellation of the sum of the contribution of (a) and (b) and 
the one of (c). Actually this cancellation occurs in every order of perturbation. 
This is the fundamental property of QED and known as the Ward-Takahashi iden¬ 
tity. Because of this cancellation, it suffices to evaluate only the modification to the 
photon propagator. 

The addition of (5.20) to (5.19) can be considered as modification of the photon 
propagator as 


ig^y ig»p j P \ ig\v 
q 2 q 2 + q 2 q 2 
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Fig. 5.10 



2 a dp 2 2a f 1 . 

I(q ) = T- / —j- — dxx{ 1 - *) log 

■air y m 2 7r y 0 


9 2 (1 -x) 


The dots in (5.23) contain terms proportional to q^q v and vanish when the prop¬ 
agator is coupled to external electron currents. Substituting (5.23) in (5.21), we 
obtain 


—^(1 ~I(q 2 ))- (5-25) 

r r 

Therefore, we find that in the 0(a) the modification of the photon propagator 
can be made just by mutiplying a factor (1 — I{q 2 ))- Explicit derivations of these 
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formulas are beyond this textbook and can be found, for example, in the books 
by Bjorken and Drell (Bjorken and Drell, 1964) or Jauch and Rohrlich (Jauch and 
Rohrlich,1976). Here, we just comment on some important points of results. 7(q 2 ) is 
infinite as mentioned above. But the infinity arises only from the first term of (5.24) 
which is logarithmically divergent and independent of q 2 . One can estimate this 
integral by introducing the cut-off parameter A for the upper limit of the integral 
and then by approaching it to oo. Actual calculation will be done for the two 
extremes, i.e. (-q 2 ) m 2 and (—q 2 ) <C m 2 . 

Let us first consider the case of (— q 2 ) m 2 . In this case, log ^1 - j ~ 

l°g(^r) an d then, we can evaluate J(q 2 ) as 



(5.26) 


Thus, for large (—q 2 ), the photon propagator is changed only by mutiplying a factor 
(1 - ^ log(^a)). The multiplicative factor from loop corrections of all orders in a 
can be also evaluated as 





(5.27) 


and thus, the full amplitude becomes 


-iM = [ieu(fci)7'*w(*i)l— . ] [ieu(k! 1 )'f /l u(k 2 )], (5.28) 

’ V 1 + fe 1 °s(4')/ 

which suggests that the effective charge or fine-structure constant a e f / is given in 
terms of a = ^ by 


Oteff 



(5.29) 


2 

So far, we considered the value of a = to be a constant, which is the coupling at 
the vertex of a “bare” electron and photon. We call it as the bare or unrenormalized 
charge/coupling and write it as ao hereafter. On the other hand, the observed 
coupling, usually called the fine-structure constant (« -Ay), is different from this 
and called the renormalized charge/coupling, which is, in practice, given by a e // 
in (5.29) and we write it as a(Q 2 ) hereafter, where Q 2 = —q 2 . Then, we rewrite 
(5.29) as 


a(Q 2 ) 


ol o 


(5.30) 
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where a(Q 2 ) and a o represent the renormalized and bare charge/coupling, respec¬ 
tively. 

Now, suppose that we can get the result, a(p 2 ) = in experiment at some 
value of Q 2 = p 2 . Then, using the formula (5.30) for a(p 2 ), one can obtain 


a(Q 2 ) 



(5.31) 


Notice that in (5.31) there is no dependence on the cut-off A and a 0 • a (Q 2 ) depends 
only on the finite measurable quanitities. Since a(Q 2 ) depends on Q 2 , a(Q 2 ) is 
called the “running coupling constant”. (5.31) shows the charge screening, that is 
to say, a(Q 2 ) decreases (increases) with decreasing (increasing) Q 2 . (See Fig. 5.7) 
Here we introduced a parameter p with the dimension of mass to avoid the infinity of 
the amplitude by renormalizing the charge, p is called the renormarization mass(or 
scale). Different choice of p corresponds to different renormalization schemes but 
final result does not depend on the choice of p. This is because the dependence of 
the amplitude on A is absorbed into a(p 2 ). 

Now, let us move to the oppsite extreme of small {—q 2 ). This limit is relevant 
in the Coulomb scattering of an electron by a static nucleus terget with charge Ze. 
In the limit (— q 2 ) -4 0, we see log(l - q 2 x{l - x)/m 2 ) ~ -q 2 x(l - x)/m 2 and thus, 
(5.24) reduces to 


Itf) = 



r 

157T m 2 


(5.32) 


where we again introduce the cut-off parameter A for upper limit of the first term 
of the integral (5.24). Note that the divergent term, i.e. the first term of this result 
is of the same form as (5.26), just by replacing m 2 by —q 2 . 

Then, to order a 2 , the matrix element for the Coulomb scattering of an electron 
by a charge Ze is calculated by replacing the factor [ieu(k' 2 )~f u u(& 2 )] in (5.20) by 
-ij v with j v = (p = Ze, j = 0) 


-iM = [ieu(fc()7 M u(A:i)]- 


V 




A 2 


nr 


2 i 


a q 
157T m 2 


= [ieu(k[)jou(k \)] —jr- 


When we write this result as 


'-■s 106 


A 2 \ a q 2 
m 2 / 157T m 2 


(-if) 

(-iZe ). (5.33) 


with 


-iM = [ie R u(k[)^ou(ki))]-^(-iZeR), 

q 


(5.34) 




(5.35) 
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Fig. 5.11 




Fig. 5.12 


we can easily check that to 0(a 2 ), (5.35) and (5.34) are equal, en is interptered as 
the measured or renormalized charge = a = . Here the infinity of the matrix 

element coming from the cut-off A —i oo is again absorbed into the observed charge 
eR by renormalizing the bare charge/coupling. 

5.3.2 Running coupling constant in QCD 

As shown in (5.8), gluons have self-interactions because of the non-Abelian nature of 

QCD. This nature produces a drastically different behavior of the running coupling 
2 

constant a s (Q 2 ) = from the one in QED discussed above. The basic quark-gluon 
interaction is given in Fig. 5.11. Concerning the one-loop corrections, there are two 
diagrams as shown in Fig. 5.12(a) and (b). (a) shows the correction from a quark- 
loop which contributes equally to every quark flavor because of flavor independence, 
(b) shows the gluon-loop correction originated from the self-interaction of gluons. 
Apart from the color factor, the contribution from the quark-loop diagram (a) is 
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essentially the same as the one in QED; it suffices to replace the factor a ^ ^ by 
a "^ for each flavor by taking into account of the color factor. The crucial difference 
between QCD and QED arises from the gluon-loop diagram (b). This diagram give 
rise to another numerical factor, a s (fj?), whose sign is opposite to the one from 
the quark-loop. Combining the contribution from both the quark-loop (a) with rif 
flavors and the gluon-loop (b), one can obtain the QCD running coupling constant 
a s(Q 2 ) just by replacing the factor in QED by a ’^ ) (§n/ - 11) as 


a s (Q 2 ) 


_ a s(p 2 ) _ 

1 + ( „-^.( f :i , og (^)- 


(5.36) 


As described before, we know that there are n/ = 6 flavors in Nature. Putting 
nf — 6 into (5.36), we see that the sign of the 2nd term in the denominator of 
(5.36) is opposite to the QED case in (5.31). Therefore, the running coupling 
constant a s (Q 2 ) shows antiscreening; ct s (Q 2 ) decreases with increasing Q 2 as known 
as “asymptotic freedom”. In contrast, for small Q 2 , a s (Q 2 ) becomes large. In fact, 
the demominator in (5.36) becomes 0 at some value of Q 2 — A q CD where 


A %„n = u?e (33 - 2n /)“ s (" 2 ). 


1 qcd — A 1 e 
By using this A qcd, we can write a s (Q 2 ) as 


a s (Q 2 ) = 


127r 


(33 - 2nj) log 

iV QCD 


(5.37) 


(5.38) 


Here we introduced a free parameter A qcd with mass dimention by removing 
the renormalization mass parameter ji. We cannot determine the value of A qcd 
theoretically in QCD. It is extracted from experiment and was determined to be 
A qcd — 200MeV for Q 2 ~ lOOGeV 2 . With rif = 5 flavors ( u , d, s, c, b) taking 
part in scattering processes, the value of a s becomes a s ~ 0.2 which justifies the 
perturbatuve calculation of QCD. The behavior of a s (Q 2 ) is depicted in Fig. 5.13. 


In summary, for large values of Q 2 much larger than A q CD ’ the effective cou¬ 
plings between quarks and gluons becomes small and thus, the strong interaction 
can be treated perturbatively. In this region, quarks and gluons behave as free 
particles (“asymptotic freedom”). In fact, this is the region of the deep inelastic 
scattering scattering where the parton picture works well. On the other hand, for 
small Q 2 region like Q 2 « A 2 q CD -> the quark-gluon coupling becomes large and the 
perturbative calculation cannot be justified. Because of the large coupling con¬ 
stant, all quarks and gluons are confined in hadrons (“confinement”). A qcd is 
the scale which separates the world of confinement scale(hadrons) and asymptotic 
freedom (free quarks and gluons). QCD is the color SU(Z ) gauge theory of strong 
interactions and is the fundamental field theory describing the dynamics of quarks 
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Fig. 5.13 


and gluons, though the whole understandings on the nonperturbative confinement 
problem still incomplete. 


Problems 

5.1 r - can decay into pairs of (e~ 9 e ), {p,~ i/ M ), (d u), and (s u); among these decay 

modes, the contribution of (s it) is very small because it is a Cabibbo-suppressed 
process and can be neglected. Neglecting the masses of all decay particles, estimate 
the branching ratio, Br = \”ii) r ^ > to be \ as given in (5.7). 

5.2 In the tree level, the color-dependent coupling factor for the process q -> q + g, 
where q and g represent a quark and a gluon, respectively, is given by (5.18). Then, 
estimate the factor for g —l q + q. 

5.3 Consulting with other textbooks, for example, the one by Bjorken and Drell 
(Bjorken and Drell, 1964), derive (5.24). 
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Chapter 6 

NEUTRINO MASSES AND NEUTRINO 
OSCILLATIONS 


6.1 Type of Fermions and Fermion Masses 

When P.A.M. Dirac first invented the spinor to describe the electron, it was a 4- 
component complex vector. Fermions described by such 4-component spinors are 
called Dirac fermions. Though the electron has a finite mass, it may be worthwhile 
considering a massless fermion. In this case a new feature arises: a state with h = 1 
(h being the helicity of the particle) and a state with h = -1 never mix with each 
other during the propagation of the fermion. This can be understood intuitively by 
considering a inertia frame, observing the fermion co-moving toward the direction of 
the fermion’s momentum. If the helicity h of the fermion is 1 in that frame, in any 
frame the helicity should be the same, just because the massless fermion is moving 
with the light velocity and any observer never can pass through it to invert the 
direction of momentum and therefore the helicity of the fermion. It is well-known 
that for massless fermion, the eigenstates of helicity and chiral fermions with definite 
eigenvalues of 75, i.e. Weyl fermions ipu and ipL, are identical. We easily know that 
each of chiral fermion forms an irreducible representation of Lorentz transformation 
SL(2,C), as [ 75 , = 0 (E Ml ' = are 6 generators of 517(2, C)). In such 

a sense, Weyl fermions are more fundamental fermions than Dirac fermions. Let 
us recall that fermions to start with in the Standard Model are Weyl fermions, 
essentially because the weak interaction maximally breaks parity symmetry. 

The statement above becomes more explicit if we use the chiral representation 
of 7 matrices, where 75 is a diagonal matrix (see also Appendix A): 


7 m 

= ( 0 ^ ^ 

\ 0 ) ’ 

( 6 . 1 ) 


= (-■!>»), 

( 6 . 2 ) 

a" 

= (-/,-crj), 

(6.3) 

75 

= *7 0 7 1 7 2 7 3 = ( 0 -I ) ’ 

(6.4) 
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where I is a 2 x 2 unit matrix and <Ji (i = 1,2,3) are Pauli matrices. In the 
chiral representation a right- and a left-handed Weyl fermions are described by 
2 -conponent complex spinors i) a and f“: 


TpR 


Va 

0 


->Pl = ( ja I , (a,a = 1,2). 


The Lorentz generators have, as we expect, a form of block diagonal 

^ = if ^ 0 V 

2 V 0 a» v ) ’ 


- a^a”, cP“'=ct'V', ( m ^ v). 


(6.5) 

( 6 . 6 ) 


(6.7) 

( 6 . 8 ) 


For a massive fermion moving with a speed lower than the speed of light, the 
helicity may change depending on the velocity of the inertia frame, observing the 
fermion: the chirality is no longer preserved for massive fermions. Accordingly the 
fermion mass term connects Weyl fermions with different chiralities: 


m(ip R ip L +'ipL' t l ) R)- 


(6.9) 


Thus the fermion suitable to describe the propagation of the massive fermion is the 
combination of these two Weyl fermions, i.e. a Dirac fermion tpD, a full 4-component 
complex spinor: 

IpD = IpR + V’L = ^ ^ > (6-10) 

rl>R = RtPd, ipL = L^d, (i?= L= i-^), (6.11) 

The free Lagrangian for the Weyl fermions is now rewritten in terms of a Dirac 
spinor ipjy. 

ip R i@ipR + tp L i<?ipL ~ m(tp R ipL + iPl^r) = ~ m)^. (6.12) 

Namely, ip R and ipi are “chiral-partners” to form a full 4-component Dirac spinor. 
Actually the chiral partners need not to be two independent Weyl spinors. We 
should note that the action of “charge conjugation” for Weyl fermions changes 
their chiralities; e.g. for ( ip R ) c = 

R(YrY = 0, L(V*) C = (V*) c . (6.13) 


i.e. ( tp R ) c = ( iP c )l• Or, in terms of 2-component notation of Weyl fermions, 


Yr = 


-> (YrY = 


o 

t 


(6.14) 
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where (r) a )* - fj & , rj 61 = e^rjp, e 12 = -e 21 = 1. Thus chiral partner of a Weyl 
spinor can be the charge conjugation of the Weyl fermion itself. Combining these 
chiral partners the following full 4-component spinors are possible to be formed: 

V-M 1 = IpR + Wr) c = ( ’Ll ) , (6.15) 

Ym2 = 4>l + (4>l) c = ^ ^ • (6.16) 

By construction these 4-component spinors ipMi,M 2 are self-conjugate under the 
charge conjugation, (V’mi,M 2 ) c = ipMi,M 2 , and are called Majorana spinors. Though 
Majorana spinors are 4-component spinors, the number of independent complex de¬ 
grees of freedom is 2, i.e. the same as that of a Weyl spinor. Thus as far as kinetic 
term is concerned, where there is no mixing between chiral partners, there is no dif¬ 
ference between Weyl and Majorana fermions and a Dirac fermion is equivalently 
describable in terms of either 2 Weyl or 2 Majorana spinors: 

+ ^M 2 *#M 2 ). (6.17) 

In other words, for massless fermions, the propagators are just the sum of those for 
independent Weyl fermions. 

Once fermions get masses, the type of fermion has real physical meaning. Clearly 
Weyl fermion is irrelevant in this case, since the mass term connects the chiral 
partners. What determines the type of the full 4-component spinor, i.e. Dirac or 
Majorana, is the choice of chiral-partners or in other words, the type of fermion 
mass term. In the case of ordinary mass term for Dirac fermion (Dirac type mass 
term), 


-moipD^D = -mDi^L^R + h.c.) = m D (£ a ria + h.c.), (6.18) 

the chiral-partners are two independent Weyl fermions £ a and rj a . We, however, 
need not to take independent 2 -componet spinors £ and rj as chiral partners and 
they can be identical. Namely if we take as chiral partners the charge conjugation 
of each Weyl fermion the following mass terms are also possible, 

"rn R {{ip R ) c i})R + h.c.) = ^m R (r) a T] a + h.c.), (6.19) 

-^rn L ({ip L ) c ip L + h.c.) = imz,(£“£ a + h.c.). (6.20) 

These mass terms clearly violate fermion number (lepton number for, e.g., neutri¬ 
nos) and are called Majorana type mass terms, while we can assign a conserved 
fermion number for the case of Dirac mass term. If we adopt the Majorana type 
mass terms listed above, the fermions suitable to describe the free Lagrangian should 
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be Majorana fermions ipMi and 4>M2'- 

4> R iqhpR - | m R ((t/j R ) c ip R + h.c.) 
ip L i<?ipL - + h.c.) 


\^mi - m R )ipMi , ( 6 . 21 ) 

\i>M2{ i Q - ™l)^M2- ( 6 . 22 ) 


6.2 Neutrino Masses 


6.2.1 Possible Types of Neutrino Masses 

As we have seen in the previous section, the Majorana mass terms violate fermion 
number. Actually the mass terms violate all possible ?7(1) global symmetries the 
Weyl fermions may have, 

rl> R -* e^R, if ,l -> (6.23) 


(j>Xj and (p R being real transformation parameters. Thus the Majorana masses are not 
allowed for charged particles like electron, since if it is the case U{ l) em symmetry 
is violated, i.e. the charge conservation breaks down. The Majorana mass term is, 
therefore, allowed only for electrically neutral particles, such as neutrinos or neutral 
gauge fermion in supersymmetric theory, such as photino. In this chapter we focus 
on the case of neutrinos. 

Neutrinos of course may have ordinary Dirac masses, in addition to the possible 
Majorana masses. Let v R and vi be right- and left-handed neutrinos. It is inter¬ 
esting to note that even if we consider the most general mass term for v R and vl 
including the Dirac mass term, 

C m = m R (v R ) c v R - ^rn L {yL) c VL - m D V R v L + h.c., (6.24) 

the free Lagrangian for the neutrino still can be written in terms of two Majorana 
neutrinos, as we now confirm. The Dirac mass term mo in C m causes a mixing 
between u R and vl- We thus have to “diagonalize” the whole mass term. The mass 
term can be neatly written by use of matrices as 


C 


m — 


1 

2 



u R 


ttil rriD 
m D m* R 


vl 

(vr) c 


+ h.c., 


(6.25) 


where the property v R v L = (i 'l) c (vr) c has been used. Thus, the mass term, as the 
whole, can be written in the form of Majorana type masse term, even though there 
is Dirac mass term as well. This is based on the observation that ul and (v R ) c 
are both left-handed Weyl fermions and have no essential difference, though they 
have opposite lepton numbers, L = 1 and L — — 1. Thus every mass term should 
be generally written in the form of Majorana mass term, — (V'il) c V’ 2 L + h.c.] in the 
case tpiL = 4>2L the mass term is “genuine” Majorana mass term, while in the case 
of Dirac mass term ipiL Y We also find that the 2x2 mass matrix above 
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is symmetric co mplex m atrix, not a hermitian matrix, in general. This property is 
due to a relation ( ViL) c VjL — {yjL,) c ViL and will be valid for more general cases with 
arbitrary numbers of generations. Though such symmetric complex matrices are 
known to be diagonalized by use of a unitary matrix in general, as we will see in the 
following section, here we consider the case where mass parameters m R , m^, mo 
are all real, for brevity. The eigenvalues m s , m a and the corresponding eigenstates 
I's, v a of the mass matrix 


are given as 


m s 

m a 

v s 

Va 

where 


M v 


m-L mo 
m D m R 


^{( m R + m L ) + \J{m R - m L ) 2 +4m 2 D }, 

^{-{m R + m L ) + \J{m R - m L ) 2 +4m|,}, 
sin6 v UL + cos 0 „( j / r ) c , 
i{cos6„vl ~ sin 0 v (u R ) c }, 


tan 2 6„ 


2m d 

m R - mL' 


(6.26) 


(6.27) 

(6.28) 

(6.29) 

(6.30) 


(6.31) 


Note that the eigenvalues of M v obtained by an orthogonal transformation are m s 
and -m a , not m a . We, however, have changed the sign, -m a -> m n , by putting 
the extra factor i for the eigenstate v a , so that the eigenvalues m s and m a get 
degenerate for the case of pure Dirac mass, m R = mL = 0, and both eigenvalues 
become positive for the case of “seesaw” scenario. The mass term now has been 
diagonalized: 


L 


m — 




( V a ) c v a + h.c.. 


(6.32) 


Thus, including the kinetic term, the free Lagrangian for the neutrino is now written 
in terms of two Majorana neutrinos obtained by combining v s and u a with their 
anti-particles as chiral partners, N s = u s + (^ s ) c and N a — v a + (^ a ) c , 


C v = \{N s {i<! - m s )N s + N a (i$ - m a )N a }. (6.33) 

In the case of a scalar field, charge conjugation is equivalent to the complex 
conjugation of the field. So a self-conjugate Majorana fermion {ip c = ip) corresponds 
to a real scalar (<p* = <p), and the general mass matrix for two Majorana particles 
just corresponds to a general mass-squared matrix for two real scalars, (pi and fa, 


£ 


0 m — 


1 

2 


( <£ i 



(6.34) 
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Actually for the case of scalar fields what matters is the mass-squared matrix. To 
make the correspondence clear, let us take 


M u Ml = 


m i + m i> 

moimi + m R ) 


mo (mi + m R ) \ 
m R + m D ) ’ 


(6.35) 


instead of M„ itself. It has been well-known that when the masses of two real scalar 
fields are degenerate, m\ = m\ = m 2 , m 12 = 0, they can be equivalently described 
by a complex scalar (p = (1/\/2)(</>i + ip 2 ), 

4ra = -TO 2 p*(f>. (6.36) 


The same thing happens for fermions; for the case of pure Dirac mass, mi, = 
mu = 0, M v Ml has degenerate eigenvalues m 2 D and two Majorana neutrinos are 
equivalently described in terms of Dirac particle, 

Ns — iN a , /a ^\ tc n-7\ 

v D =-y=— = V R + v L (e u = -), (6.37) 

whose free Lagrangian is written as 


C v = vr>{i(j) — m £>) vd ■ 


(6.38) 


The factor 1/2 appearing in the free Lagrangian for Majorana neutrinos (6.32) just 
corresponds to the same normalization factor in the free Lagrangian for real scalars. 
Hence, just as in the case of scalar fields, such normalized Majorana fields N s and 
N a should have the same propagator as the one for a Dirac field. For instance, 
for the Majorana fermion ipMi in (6.21) the propagator without chirality flip, and 
therefore without lepton number violation, is given (in momentum space) as usual, 

R ■ (0\ipMi V’mi |0) ■ L = (0\ipRip R \0) = R ■ — -- L. (6.39) 

p ~ m R 

This suggests that both tpMi and ip mi are properly normalized fields. Thus, gener¬ 
ally the propagator for a Majorana field ipM with a mass to is given as 

(OlVMtMO) = -r- 5 — (6.40) 

P — TO 

(OIV’mV’mIO) = (6.41) 

A characteristic feature of Majorana field is that the propagator, apparently vi¬ 
olating the fermion number, (0 |V'm'^m| 0), exists in addition to the ordinary one, 
(0 |t/’m^m| 0). In (6.41), t stands for the transpose and C = * 7 ° 7 2 is the charge 
conjugation matrix. The propagator of the type (O|i/’at0m|O) is needed in the cal¬ 
culation of e.g. neutrino-less double /9-decay. 
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6.2.2 The Mechanism of Neutrino Mass Generation 

Recent observations of neutrino oscillations strongly suggest that neutrinos have 
finite masses. These data on neutrino oscillations, however, do not tell us the type 
of neutrino masses, i.e. Dirac or Majorana. Though Majorana masses violate lep¬ 
ton number, the smallness of neutrino masses prevent us to confirm or exclude the 
Majorana property by use of the data of the processes sensitive to lepton number 
violation, whose typical example is the neutrino-less double /3-decay. Thus, in the 
consideration of the scenario of neutrino mass generation, we do not have a priori 
any idea of relative strengths of Dirac and Majorana masses. Therefore, let us now 
consider three typical (extreme) cases, depending on the relative strengths of these 
masses. We will find that the smallness of neutrino masses can be naturally ac¬ 
counted for in the third possibility, so-called seesaw scenario. 

(A) Pure Dirac 

We first consider the case that neutrinos have only Dirac masses, mn = mi = 0, 
and neutrinos are Dirac particles. As we have already seen above, in this case a 
Dirac neutrino may be regarded as the superposition of two Majorana neutrinos N s 
and N a with degenerate masses, m s = m a = mo (see (6.27), (6.28)). The mixing 
angle is “maximal”, i.e. 0 V = 7t/ 4, as can be seen in (6.31). This maximal mixing, 
however, does not lead to any neutrino oscillation, though it mimics the flavor or 
generation mixing; as we will discuss, neutrino oscillation necessitates not only a 
mixing angle but also a mass (-squared) difference. In this pure Dirac case, we thus 
need flavor mixings among different generations to realize the neutrino oscillations, 
as will be discussed in the next section. It is interesting to note that though each of 
the two Majorana neutrinos N s , N a does contribute to the lepton-number violating 
neutrino-less double /3-decay, their contributions, obtained by use of the propagator 
(6.41), exactly cancel with each other, due to the factor i in (6.30). This should 
be the case, since in pure Dirac case the lepton number should be preserved. In 
this case, however, we have no special reason to expect that the Dirac masses of 
neutrinos should be much smaller than the Dirac masses of corresponding charged 
particles, such as electron. So a serious problem of why neutrinos masses are so 
small remains to be solved. 

(B) Pseudo-Dirac 

We next consider what happens if we put small Majorana masses, m^, mi -C mo- 
The neutrinos are still almost Dirac particle and are called pseudo-Dirac neutrinos 
(Wolfenstein, 1981). The mixing angle is still almost maximal, i.e. 6„ ~ 7r/4. 
One remarkable feature is that in this case there appears a slight difference of mass 
eigenvalues m 2 s -m? a ~ 2mn(mR + mi). Together with the almost maximal mixing, 
this mass-squared difference leads to a neutrino oscillation even if there is only 1 
generation (Kobayashi and Lim, 2001). In fact a neutrino oscillation vl —>• {vpt) c 
occurs with almost maximal mixing. The probability for the neutrino produced as 
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i 'l at time 0 with energy E to be observed as (vr) c at time t, in the simplified 1 
generation scheme, is known to be given as 

P(,^ (,*)') = (6.42) 

One interesting feature of the neutrino oscillations of pseudo-Dirac neutrinos is 
that the maximal mixing, strongly suggested by the Super-Kamiokande data on the 
atmospheric neutrino oscillation, is almost automatically realized. Such neutrino 
oscillations, however, are those into “sterile” states {vr) c without electro-weak in¬ 
teraction, which are not favored by the data of Super-Kamiokande experiments. 
Again, the problem of smallness of neutrino masses cannot be naturally solved in 
this scenario. 


(C) Seesaw (Yanagida, 1979; Gell-Mann-Ramond-Slansky, 1979) 

We finally consider another extreme case where Majorana masses are much larger 
than the Dirac mass. We, however, find that the Majorana mass for left-handed 
neutrino, tul, cannot be large, while the Majorana mass for right-handed neutrino, 
mu, can be sufficiently large. This difference essentially comes from the difference 
of the transformation property of each Majorana mass term under SU(2)l x £/(l)y, 
the gauge group of the standard model. We immediately know that the right-handed 
Majorana mass term ttir{vr) c vr is gauge invariant under SU(2)i x 17(1), as vr is 
a SU( 2 )l singlet neutral particle. On the other hand, the left-handed Majorana 
mass term mi(Ki) c Vi, is not gauge invariant, as vl belongs to a SU(2)i doublet, 
and this bare mass term is not allowed in the Lagrangian. In fact, we know that the 
product of the fields ( vl) c vl should belong to either singlet or triplet of SU(2)l 
(doublet x doublet = singlet + triplet). Actually it cannot be the singlet, since 
{v L ) c v L behaves as I 3 = 1. Thus {vlY^l should behave as a component of SU{2)l 
triplet, and to form a gauge invariant (renormalizable) Yukawa interaction, we need 
a Higgs Hr belonging to SU(2)l triplet, so that ttil ~ f(Hr) with / being the 
Yukawa coupling. The VEV ( Hr), however, cannot be large. If the VEV is larger 
than or comparable to the VEV of ordinary SU (2) l doublet Higgs, the famous 
relation, well-established experimentally, 


P = 


M? 


w 


Mlcos 2 6 w 


- 1 , 


(6.43) 


no longer holds at the classical (tree) level. On the other hand there is no good 
reason to expect that tur, being SU (2) l invariant, should be small. Rather, it will 
be natural to expect that it is much larger than the VEV of the doublet Higgs, 
i.e. than the weak scale Mw , since it is quite possible that the tur actually comes 
from the energy scale of some “new physics” beyond the standard model, which 
is expected much higher than Mw- The above argument suggests that ttil -C 
ttid rriR. Here, for brevity, let us assume that Ht is absent and therefore 
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rriL = O (tud <C tur). In this case the neutrino mass matrix reduces to 


M v 


0 mo \ 
tur ) ' 


(6.44) 


One immediately knows that one mass eigenvalue of M v , m s ~ ttir under mu C 
tur, and another eigenvalue 

m a ~ < mo, (6.45) 

m R 

since the product of two eigenvalues is given by det M v = -m 2 D . (Strictly speaking 
2 

this means m a ~ with opposite sign, but the extra minus sign can be absorbed 

by putting a phase i to the eigenstate u a , as is seen in (6.30).) These eigenvalues 
may be obtained from (6.27) and (6.28), as well. We thus have obtained a naturally 
small (<C mo) neutrino mass m a by making m R much larger than me. The relation 
m s ■ m a ~ rn 2 D suggests the terminology seesaw. Namely, when one eigenstate N s 
gets heavy another eigenstate N a is “lifted”. We know that under me « tur the 
mixing angle 0 V is small 

e v ~ ^«1 (6.46) 

m R 

N s ~ Ur + ( u R ) c , N a ~ i{u L - {u L ) c }. (6.47) 


As the heavier Majorana neutrino N s decouples from lower energy processes, only 
N a ~ i{uL — ( ul) c } will remain and there will be no neutrino oscillation in one 
generation scheme. Thus, as in the case of pure Dirac scenario, to have neutrino 
oscillation, flavor or generation mixing is inevitable. 

One may wonder, in the absence of the triplet Higgs Hr, which operator can 
provide the small Majorana mass m a , which is effectively the Majorana mass for 
the left-handed neutrino, as N a ~ - (ur) c }. It turns out that though there 

is no renormalizable operator in the original Lagrangian relevant for the Majorana 
mass, a higher dimensional (d = 5) “irrelevanth effective operator, 


tfecract) ■ ( 1 4 e l L )ea a C ^ ^ ^ , (6.48) 

plays the role, where <fi = (<p + , ip 0 ) 1 is the Higgs doublet and e is 2x 2 anti-symmetric 
matrix (e 12 = —e 21 = 1) and C is the charge conjugation matrix. The coefficient 
c is a dimensionless coupling and if we regard c = fp {fo- Yukawa coupling in 
the Dirac mass term), M = m R and replace <p° by the VEV v/\/2, we get a left- 
handed Majorana mass, ~ ~ , in accordance with the result obtained by 

the diagonalization of the mass matrix. In fact, the diagram shown in Fig. 6.1 is 
known to provide (a part of) the effective higher-dimensional operator, when the 
momentum is ignored compared with tur in the propagator of the internal line. 
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Fig. 6.1 


When M goes to infinity the effective operator vanishes. Thus the essence of 
seesaw may be understood as the decoupling phenomenon of a heavy particle with 
mass M, which is right-handed Majorana neutrino N s in the case of seesaw mech¬ 
anism. The importance of the argument in terms of the effective operator is that 
the M needs not to come from the right-handed Majorana neutrino, but may be 
attributed to any heavy SU(2) singlet field with mass M. 


6.3 Flavor Mixing and Neutrino Oscillation (in the Vacuum) 

If we include right-handed neutrinos to accommodate neutrino masses, the lepton 
sector is described by 3 doublets and 6 singlets of SU (2)/.: 

(6.49) 

Without loss of generality, the charged leptons, ej i, cr etc., may be understood as 
their mass eigenstates, and their partners in the doublets, v e L etc. are states emitted 
by weak interaction processes and are called “weak eigenstates”. Suppose the mass 
matrix for neutrinos in the base of weak eigenstates is diagonal. (For brevity we 
here assume neutrinos have only Dirac masses), Then, the generation number or 
quantum number of flavor such as electron number L e ( L e = 1 for e~, v e , L e = 0 for 
others), muon-number etc. are strictly conserved. If, on the other hand, the mass 
matrix is non-diagonal, the flavor changing processes such as u e -> or n -A ej 
will become possible. The phenomenon such as v e j/ m , where a neutrino of some 
specific flavor is transformed into a neutrino of another flavor, is called “neutrino 
oscillation”, since the probability of the transition oscillates as the function of time. 
Even if we switch on flavor mixing, provided the neutrino masses are all vanishing 
or, more generally, degenerate, the amplitudes of flavor changing processes will 
exactly vanish. This can be explicitly checked in the formula of neutrino oscillation 
probabilities, as we will see later (see for instance (6.69) for A= 0). This fact 
also may be easily understood from a symmetry argument. When neutrino masses 
are all degenerate, there appears a global symmetry U (3) among 3 generations or 
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VfiL 
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VtL \ . 
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flavors in neutrino sector, and by use of Noether’s theorem, we easily find that 
there should be 3 conserved additive quantum numbers, which may be interpreted 
as L e etc. Or, more intuitively, if neutrino masses are all degenerate there would 
be nothing to distinguish neutrino flavors, and we may always perform a unitary 
transformation, so that all flavor mixing angles vanish in the new base. Thus to 
realize flavor changing processes we need both flavor (or generation) mixing and 
non-degenerate neutrino masses. This is in complete similarity with the content of 
GIM mechanism in quark sector. 

It should be emphasized that now there are increasing evidences for neutrino os¬ 
cillations claimed by the experiments to detect the solar and atmospheric neutrinos, 
as will be discussed later in this chapter. The phenomenon of neutrino oscillation 
is quite interesting by its own right. In addition, theoretically, it is expected to 
play a central role in the search for and the establishment of “new physics”, since it 
clearly indicates non-vanishing neutrino masses and therefore some physics beyond 
the standard model. 

A nice comprehensive review on the neutrino masses and neutrino oscillations is 
given, e.g., in the recent textbook by Fukugita and Yanagida (Fukugita-Yanagida, 
2003). 


6.3.1 Flavor Mixing 

We now discuss the flavor mixing, the origin of neutrino oscillation, for two typical 
scenarios, (A) pure Dirac, and (B) seesaw. 

(A) Flavor mixing in the pure Dirac scenario 

Just as in quark sector, if neutrinos have only Dirac masses, the mass term for weak 
eigenstates (v eL , v t l) and {v eR , u^ R , v tR ) is given as 

C m = (m D )a 0 vn:v 0R + h.c. (a,0 = e,f.i, t). (6.50) 

Though a Dirac fermion can be decomposed into two Majorana fermions with de¬ 
generate masses as we have already seen in the previous section, we do not take 
this picture, and the mass matrix is 3 x 3 matrix just as in the quark sector. As 
we have learned in the quark sector (see (4.144), (4.145)), the matrix mo can be 
diagonalized by a bi-unitary transformation, 

/ mi 0 0 \ 

U^moV =1 0 m 2 0 I , (6.51) 

\ 0 0 m 3 / 

where U and V are unitary matrices. The left-handed eigenstates of mass matrix, 
i.e. mass-eigenstates, v 2 l, l with Dirac masses, mi, m 2 , m 3 are related to 
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weak eigenstates as 


VeL \ 


( ^ \ 


^L I 

= U- 

i V2L ■ 

(6.52) 

VtL ) 


V "ZL } 



Accordingly the charged current interaction of left-handed leptons is written in 
terms of U 


£c 



( e L pL L tl )U 


VlL 

V21 

VZL 


■ W", 


(6.53) 


and the unitary matrix, corresponding to the Kobayashi-Maskawa matrix in quark 
sector, is called Maki-Nakagawa-Sakata (MNS) matrix (Maki, Nakagawa and Sakata, 
1962). In the 517(2)/, x U(\)y gauge theory, only left-handed neutrinos have weak 
interactions, and as we will see in the following section, only neutrino oscillations 
without chirality-flip are important. These lead to the conclusion that not mo itself, 
but only the combination mum}) is relevant for the physical processes, including 
neutrino oscillation. Since 


mam[| = U dia,g(ml,m? 2 ,ml) U\ 


(6.54) 


the unitary transformation of right-handed neutrinos V never appears in the phys¬ 
ical observables. 


(B) Flavor mixing in the seesaw scenario 

Now with 3 generations the seesaw mechanism should be implemented in the 6 x 
6 mass matrix. In the base of ip = ( u eL , u^l, i ' t l, ( v e R) c , (v^r) 0 , {vtrYY, the 6 
x 6 mass matrix m„ takes the form of 




mi 

m l D 

mo 

m R 


(6.55) 


where 3x3 matrices mi, m R , mo denote left-handed, right-handed Majorana 
mass matrices and Dirac mass matrix, respectively. As we have already seen in the 
simplified 1 generation scheme, the mass matrix m„, as the whole, is a symmetric 
matrix, m * = m„ {m l L = = hir). To consider the seesaw scenario, we 

assume that mi = 0. The mass matrix with mi = 0 is easily achieved in the 
standard model, by introducing right-handed neutrinos and their Yukawa couplings 
and bare mass terms, 

fa(3 ( ^aL l« L ) ~ (m R ) a 0(u aR ) c U 0 R, (6.56) 


where ( mo) a s = fpa ' 75 > ^ — *^ 2 <i>* (see (4.137)) and ^ is the VEV of the 
neutral Higgs ip°. To account for the seesaw mechanism we understand that all 
matrix elements of m R are much larger than those of mo- Then, as we have seen in 
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the 1 generation scheme, the mutual mixings among v a L and (v aR ) c (corresponding 
to 6 U in (6.29), (6.30)) are small, being suppressed by the ratios of elements of mo 
to those of tur. Just as in the 1 generation scheme, the mass matrix is made into the 
form of block-diagonal by a suitable unitary transformation with the small mixing 
angles, 


( _ U ~im dTUr 1 A mv ( H_ rn ^D rn R 

\ m R 1 m* D I ) u \ ~im* R l mo I 

( m oK)" lm D 0 A 
V 0 m* R ) 


(6.57) 


Thus again eigenstates with masses of the order of m R , decouple from the low 
energy effective theory, and the mass matrix for the remaining neutrinos, which are 
approximately v a L, may be regarded as 


m vL = m^m^) 1 m D , 


(6.58) 


which is again a symmetric matrix. 

It is mathematically proven that a symmetric complex matrix can be diagonal¬ 
ized with real eigenvalues by a unitary transformation, 

— diag(mi, m 2 , m 3 ). (6.59) 

Again what matters in the neutrino oscillation is not itself, but 

m ], L m uL = U 17*. (6.60) 

It is quite important to note that the right-hand-side of the equation is exactly 
the same as the corresponding one in pure Dirac scenario, (6.54). Thus as far as 
our concern is neutrino oscillation, to discriminate the scenarios of neutrino mass 
generation, pure Dirac or seesaw, is impossible. Note, however, if we observe a 
process with chirality-flip, vr -> vr or vr —> (i 'r) c , it is in principle possible to 
discriminate these two scenarios, since the final states vr and {vr) c have opposite 
lepton numbers and only ( rr) c has weak interaction. The typical and interesting 
example with chirality flip and lepton number violation is the neutrino-less double 
/3-decay. If the process is observed, it will clearly indicate the Majorana property 
suggested by the seesaw scenario, though the decay amplitude is suppressed by the 
small Majorana neutrino masses of vr, and the detection is not easy. 


6.3.2 Neutrino Oscillation (in the Vacuum) 

Now let us move to the discussion on the neutrino oscillation in the vacuum. (Neu¬ 
trino oscillation in the presence of matter will be discussed in the next section.) 
The neutrino oscillation is known for long time to be the most natural framework 
to explain the lower than expected capture rates of solar and atmospheric neutrinos, 
i.e. the puzzles of solar and atmospheric neutrinos, which we will discuss later in 
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this chapter. We have argued that neutrino oscillation is possible, for pure Dirac or 
seesaw scenarios, only if there are more than 1 generation of leptons. Strictly speak¬ 
ing even in 1 generation scheme, there exists an “oscillation” of the type “ vr -A vr” 
with chirality flip. This seems to be able to explain, just with 1 generation, the puz¬ 
zle of e.g. solar neutrino, since the transformed state vr has no weak interaction 
(a “sterile” state) and escapes the detection. However, unfortunately this is not 
the case. As the neutrino masses are known to be quite small, neutrinos are highly 
relativistic under the real situation of experimental detection, i.e. m v C E v . Since 
for the relativistic particle the chirality is nearly equal to the helicity and therefore 
is almost preserved, the probability of the transition vr -a v r is quite small, 

P{vl -A Ur) ~ (-^r) 2 1. (6.61) 

The fact that the chirality flip is negligible suggests that in the consideration of 
neutrino oscillation the spin degree of freedom is not relevant. (If we take the ef¬ 
fect of background magnetic field into account, which may be suitable for neutrinos 
propagating inside the sun or supernovae, the spin-precession of the neutrinos due 
to the anomalous magnetic moment may take place without the chiral suppression 
factor {mv/Ev) 2 (Voloshin et al., 1988; Lim and Marciano, 1988; Akhmedov, 1988), 
and this statement does not hold. We, however, ignore here the effect of the mag¬ 
netic field.) Thus the relevant equation of motion for neutrinos is the Klein-Gordon 
equation, rather than the Dirac equation. We have already discussed that when 
the chirality flip can be ignored there is no essential difference, concerning neutrino 
oscillations, in the two cases of pure Dirac and seesaw. Thus we now assume that 
neutrinos are pure Dirac particles. The mass eigenstates un (i = 1,2,3) obey the 
following Klein-Gordon equation 

(□ + m?) Vi = 0, (6.62) 


whose solution with definite momentum and (positive) energy is just an ordinary 
plane-wave solution 


Vi = = e~ iEit ■ e ips , 


(6.63) 


where Ei = \Jp 2 + rn'j and the 3-momentum p is taken to be common for all V{ 
for convenience. For relativistic neutrinos, |p| 3> m t , the approximation Ei ~ \p\ + 
(m?/2|p|) ~ |/7| + (mf /2E) (E: the average neutrino energy) is valid. Furthermore, 
the factor e~ i ^ t • e ip ’ s is an overall phase factor, common for all neutrinos, and 
does not affects the physical observables. Thus if we regard the matter waves of 
neutrinos as Vi(t) = e ~ ii ' m P 2E \ they obviously satisfy the following equation of 
motion 



IE 

0 


2 E 


V o o 



(6.64) 
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As neutrinos are emitted and observed as weak-eigenstates, not as mass-eigenstates, 
it will be useful to consider the time-evolution equation in the base of weak- 
eigenstates. The (6.52), (6.54) tells us that the differential equation is obtained 
by a unitary transformation with the MNS matrix U: 



U 


/ m i 

' 2 E 

0 

V o 


o \ 


2 E 
0 


■ f/t 


2 E 




(6.65) 


As we have seen above this expression holds for both of pure Dirac and seesaw 
scenarios. This is why we cannot discriminate these two types of neutrino mass 
generation just from the data of neutrino oscillations. The above evolution equation 
is readily solved to be 


/ v e (t) \ . f i/ e (0) \ 

"/*(*) = exp(-—m^m+t) • M°) @ 

V Mt) J \ *(0) J 



( 

2 

0 

0 ^ 

1 

' ^e(0) \ 


u 


0 

e-^ 

o 

W 

^(0) 

, (6.66) 


\ 

0 

0 

e-^t j 

\ 

< Vr(0) J 



where a relation exp(- = ex p{~ jeU m 2 ^^UU) 

= Uexp{-^m 2 dia& t)W ( m ^ ia g = diag(m?,mSj,m§)) has been used. 

Suppose a neutrino is born as v a at time 0. The initial condition is then given 

as i'a(O) = 1, others = 0. Under this initial condition, the probability amplitude 

m ? 

to detect v 0 at time t becomes v 0 (t) = U* t . Thus the probability 

for the neutrino born as v a at time 0 to be observed as up at time t is given by a 
formula (see Fig. 6.2) 


P{v a -i u 0 ) 


| £ U 0i e-^* U*i \ 2 

i 

| Y, U 0 i e- 1 ^ U*i\ 2 . 

i 


(6.67) 


In the second line of (6.67) A m 2 ^ = rrif — m 2 (Am a = 0) and an overall phase 
factor has been ignored. This formula clearly shows the fact that to get neutrino 
oscillation both flavor mixing described by U and the mass (-squared) differences 
Am 2 are necessary. In fact, if A= Am^ = 0 then P(v a —> vp) — 0 for 
a ^ P due to the unitarity {UU‘ r )p a = 0. More intuitively, the neutrino oscillation 
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u* 



Upi V 0 


Fig. 6.2 


is purely quantum mechanical effect and may be understood as a sort of “beat” 
of (coherent) matter waves describing mass eigenstates Vi. The beat, of course, 
needs slight differences of frequencies, i.e. slight mass differences of i/j (for a fixed 
momentum p to realize the coherence). Though we have supposed that there are 3 
generations, actually the above formula is valid for arbitrary number of generations. 

One remark here is concerning CP violating phases. In the case of pure Dirac 
neutrinos the physically meaningful degrees of freedom of CP violating phases in 
the MNS matrix U is ( ra ~ 1 ^ n ~ 2 ) (n : the number of generations), just as in the ar¬ 
gument of Kobayashi-Maskawa matrix in the quark sector. In the case of Majorana 
neutrinos suggested by the seesaw scenario, however, it is known that the argument 
needs some modification. Namely, the re-phasing of (left-handed) neutrino states 
in order to eliminate some phases in U defined in (6.59) is no longer possible, while 
the re-phasing of charged leptons goes just as in the quark sector. This is because 
the Majorana mass term for mass-eigenstates iq, —\rrii (i/n ) 0 ViL > is not invariant 
under the re-phasing vn vu, as the mass term violates U(l) symmetries, 

especially the lepton number, in general. Thus the number of physically meaning¬ 
ful CP-violating phases is modified into n 2 - — n = . The difference 

”^ n 2 ~ x ^ — ( n " 1 K n ^ jl — n — 1 denotes the number of newly added CP phases for 
Majorana neutrinos and these phases are called Majorana-phases. Thus even in 
2 generation we have CP violation, in principle. Such Majorana-phases, however, 
does not appear in neutrino oscillation processes. This is simply because the dif¬ 
ference between pure Dirac and seesaw (or Majorana) scenarios does not manifest 
itself in the neutrino oscillations. The essence of this statement is that even if there 
may appear a phase e 2t ^ m; by the re-phasing of neutrinos, what matters in the 
oscillation probability is the combined factor (e 2, ^mi)(e 2, ^mi)* = m 2 , not the 
mass itself. Recall that the neutrino oscillation we are considering is the one with¬ 
out chirality-flip. This suggests that the Majorana phases are really meaningful in 
the processes with chirality flip and the violation of lepton number, whose typical 
example is neutrino-less double /3-decay. In fact, the factor i appearing in (6.30) is 
a sort of Majorana phase, and we have seen there that the phase played a role to 
cancel the amplitude of the neutrino-less double /3-decay in the pure Dirac scenario. 

Though there exist 3 generations, in reality, for illustrative purpose let us discuss 
for a while the simplified 2 generation scheme, assuming that v a = v e , i/ M and 
Vi = v\, V 2 - We may assume without loss of generality that the mixing matrix U is 





Flavor Mixing and Neutrino Oscillation (in the Vacuum) 


121 


real orthogonal matrix, as there does not appear any CP violating phase for n — 2: 

TT ( cos # sin 6 \ . 

U = . 6.68 

\ —sin 9 cos 9 J ' 

In this case, the mass-squared difference is unique. Then (6.67) tells us that 

the transition probability of the process u e —> reads as 


Am 2 

P{y e —> Vu) = sin 2 2# sin 2 (——— t). 

4E 


(6.69) 


As we expected, P(v e —1 v^) = 0 for 9 = 0 or Am^ = 0. On the other hand the 
“survival probability” of v e is given as 

A 

P{v e -A v e ) = cos 4 # + sin 4 # + 2cos 2 #sin 2 #cos( £) 

2 E 

= 1 - sin 2 2# sin 2 ( ^™ 21 t). (6.70) 

Ah 

These probabilities satisfy a relation 

P{v e -4 v e ) + P{v e -l v^) = 1, (6.71) 

which reflects the conservation of probability. 

For an arbitrary number of generations, the CPT theorem implies 

P(v a -4 up) = P{vp -4 T£). (6.72) 

If CP is an exact symmetry, we further get 

P(v a -4 vp) = P(v^ -l vp). (6.73) 

If T is an exact symmetry, we get 

P(v a -4 vp) = P[vp -l v a ). (6-74) 

(6.73) is obtainable from (6.72) and (6.74) as the CP-invariance is equivalent to 
T-invariance, under the CPT theorem. In the two generation case there is no CP 
violating effect, and all of these relations hold. (These relations are spoilt, in general, 
once the matter effects, discussed in the next section, are taken'into account.) For 
instance, 


P (-> v e ) = P{v e -l i/ M ), 


(6.75) 


and therefore the unitarity for v e , (6.71), and the corresponding relation for v^ 
imply 


P(l/ M -4 Vp) = P(v e -4 v e ). 


(6.76) 


In realistic experimental situation, it is quite possible that the wave length of 
the neutrino oscillation is much smaller than the uncertainties of the positions of 
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production and detection points of the neutrinos. If it is the case, the relevant 
quantities are time-averaged (the average over the positions is equivalent to the 
time-average) transition or survival probabilities. For instance, in the 2 generation 
scheme, we have the time-averaged probabilities denoted by P, 

P(u e -4 Vp) = ^sin 2 20, (6.77) 

P(u e -4 v e ) = 1 - ^sin 2 2$. (6.78) 

Though the survival probability (6.70) itself can be sufficiently small for 9 ~ 7r/4, 
the time-averaged probability has a lower bound, 

P(u e -4 v e ) > (6.79) 

This can be easily generalized to the scheme of arbitrary number of generations n : 

P{y e -4 u e ) > — . (6.80) 

n 

To prove this inequality, we first notice that when time-average is taken, 6.67 pro¬ 
vides us 

?(*« ^ «*) = ]pqw| 2 |tf«i| 2 - Ml) 

i 

In particular, the averaged survival probability of v a is simply given as 

= E Fail 4 - (6.82) 

i 

Then the inequality is easily proven by use of the unitarity condition of the MNS 
matrix, ]P i \U a i\ 2 = 1. The square of this unitariry relation yields 

E |f/ ai -| 4 + 2 E \U a i\ 2 \U a j\ 2 = 1. (6.83) 

i i<j 

On the other hand, we have a trivial relation 

E(M - \U aj \ 2 ) 2 > 0, (6.84) 

ij 

which gives 

(n - 

Summing up (6.83) and 

T\u°i\ 4 >-- ( 6 - 86 ) 

“ n 


i)(E i^-i 4 ) - 2 E \ u «i\ 2 \ u °j\ 2 > ( 6 - 85 ) 

i i<j 

(6.85), and dividing by n, we get what we need 
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The inequality P(i/ e -4 v e ) > \ is the simplest example in the 2 generation scheme. 
We further note that the equality in the above relation, i.e. \U a i\ A — ^ is real¬ 
ized when the equality in the (6.84) is met: |£/ a i| 2 = \U a 2 1 2 = = \U an \ 2 , namely 

in the case of “maximal mixing”. 


6.4 Resonant Neutrino Matter Oscillation 

We have seen in the previous section that the time-averaged survival probability 
never gets lower than 1/n (n : the number of generations). Thus, in the 3 genera¬ 
tion scheme it is impossible for the vacuum oscillation (neutrino oscillation in the 
vacuum) to account for data which indicates the survival probability lower than 
1/3, such as the one of the pioneering solar neutrino experiment by R. Davis and 
collaborators. Furthermore to realize the minimum value 1/3, a fine tuning of the 
mixing angles is necessary so that |C/«i| 2 = \U a 2 \ 2 = \U a y,\ 2 is realized. 

Such difficulty may be overcome, once we invoke to a resonant neutrino os¬ 
cillation due to the interaction of neutrinos with matter inside the medium, e.g. 
the sun. Before discussing the detail of the “resonant matter oscillation” it will 
be useful to note that the matter effect (the interaction of neutrinos with matter) 
leads to an additional potential energy V(x) only for v e . The V(x) then causes 
the change of the frequency of the matter wave of v e , depending on the coordinate 
x , and therefore time t (x ~ t), for a highly relativistic light neutrinos, while the 
frequency of the matter wave of another flavor, e.g. in 2 generation scheme, 
stays a constant. Thus it is quite possible that at some point, or equivalently at 
some time, the frequencies of these two matter waves just coincide, and a resonance 
phenomenon occurs. If we start from the matter wave of i/ e , at the resonance point 
almost complete transition of the matter wave from v e to is possible, even for 
small mixing angle 9, as far as some suitable condition is met. It is analogous to the 
case of two tuning-folks of musical instrument, the length of one of these two folks 
being variable. When the lengths of two folks coincide the vibration of one folk 
may be resonantly transferred into that of another folk, almost completely. One 
important thing is that for the transfer to be efficient the variation of the length 
should be slow enough, i.e. an adiabaticity condition should be met. The role of 
the mixing angle 6 is played by the air (!), which mediates the sound connecting 
the vibrations of the two tuning folks (see Fig. 6.3). 

Now let us confirm the above intuitive argument really holds by use of math¬ 
ematical formulae. First let us calculate the potential V{x). Since neutrino os¬ 
cillation is caused by the interference effect (beat) of coherent matter waves with 
a fixed direction of momentum p, the potential V(x) should be attributed to the 
elastic forward scattering of neutrinos off electron, proton and neutron inside the 
medium, due to the weak interaction. V (x) changes the dispersion relation between 
|p| and E of neutrinos, and therefore the index of refraction of neutrinos. Such elas- 
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Fig. 6.3 



Fig. 6.4 

tic scattering with protons and neutrons are possible only through neutral current 
processes (Fig. 6.4(a)). In the case of neutrino interaction with electron, however, 
in addition to the neutral current process, only v e has a charged current process 
(Fig. 6.4(b)). The matter effect due to the neutral current process via Z-exchange, 
V n (x) is universal for all of three neutrinos. This V n (t) provides an overall phase 
factor exp (—i V n (t')dt') for all neutrinos. The universal phase has no physical 
consequence, since neutrino oscillation is caused by superposition of multiple matter 
waves with slightly different frequencies, i.e. by the “beat” of matter waves. The re¬ 
maining charged current process only for v e seems to be a scattering process, rather 
than providing a potential. It, however, turns out that it can be interpreted as a 
process to give the potential. The charged current process, Fig. 6.4(b), provides an 
effective 4-Fermi contact interaction Hamiltonian (for neutrino energies E < Mw) 

Gf 

■ Velnil ~ 7s)e • e 7 "(l - 75 H- (6.87) 

The (V - A) x (V — A) type 4-Fermi coupling is known to be rewritten by use of 
Fierz transformation as 

• FeT7i(l ~ Ts)^ • e 7 M (l — 7 5 )e, (6.88) 
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which is practically equivalent to the neutral current process. Let us recall that in 
QED, a background electric Coulomb potential V (a;) interact with electron as e 7 oe x 
V{x). Similarly, the above 4-Fermi interaction may be regarded as an interaction 
of the left-handed v e with “Coulomb” type “static” potential V c (x) 

v e Lj 0 v eL ■ V c {x), (6.89) 

V c (x) = V2G F N e (x), (6.90) 


where e 7 °e is identified with the electron number density N e (x), and other com¬ 
ponents, e 7 *e (i = 1 , 2 ,3),e 7 M 7 se have been neglected as they are expected to be 
proportional to the expectation values of velocity or spin of the electron and may 
be neglected for static and un-polarized medium of electron. Let us again recall 
that under the influence of 4-potential A M = (V,A) the dispersion relation be¬ 
tween momentum and energy of v e is modified into E — V c = y (p — A) 2 +m 2 or 

E = \J (p — A) 2 + to 2 + V c . Thus concerning the static un-polarized medium, only 
u e i gets additional potential energy V c , and corresponding change of the energy. 
Thus the time evolution equation, “Schrodinger equation” in the base of ( 17 , , v T ), 

(6.65) is accordingly modified into 



0 

Amj, 
2 E 


0 


0 

0 

Am^| 

2E 


\ 


■ u' + 


J 


a(t ) 0 0 

0 0 0 

0 0 0 



(6.91) 


where the matter effect has been indicated as a{x) = y/2G F N e (x), instead of V c , 
and the replacement mf Arofj has been done ignoring an overall phase associated 
with m\!2E. 

Though we should treat the realistic 3 generation scheme, such treatment is com¬ 
plicated, generally speaking. As we will discuss in the next section, fortunately there 
is an approximate reduction formula, valid under the hierarchy of mass-squared dif¬ 
ferences AAm^j, which enables us to reduce the problem in 3 generation 
scheme to that in an effective 2 generation scheme. Therefore, we will deal with the 
matter oscillation in 2 generation scheme with a set of generic parameters of one 
mass-squared difference and one mixing angle (Am 2 ,9), in this section. Such argu¬ 
ment will be useful, even in the realistic 3 generation scheme, since the reduction 
formula is valid. 

In the simplified 2 generation scheme, by use of (the simplified version of) (6.91) 
and ( 6 . 68 ) the time-evolution equation can be explicitly written (after subtracting 
a common factor 1 sin 2 # at the diagonal elements) as 


i*L ( Ve \ - ( G F N e (t) AzAisin 20 \ / Vg \ 
dt\u M ) ^ ^#sin2 9 ^cos2 9 )\v») 

Since the “Hamiltonian” H(t), the 2x2 matrix, is time dependent, 


(6.92) 
the above dif- 
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ferential equation can not be solved analytically, unless the N e (t) has some specific 
time dependence. However, in an appropriate circumstance, we may still get a sim¬ 
ple analytic formula for the neutrino oscillation. Namely, as we have seen in the 
analogous case of the tuning-folk, as far as the time dependence of a time variant fre¬ 
quency is mild (the exact meaning of this will be defined shortly), almost complete 
conversion of one matter wave to the another should be possible. The condition of 
the “mild time dependence” is called “adiabaticity”. The resonance phenomenon, 
which occurs when the two frequencies become identical, should happen in this sys¬ 
tem of two neutrinos when the two diagonal elements of H(t) becomes the same, 
namely at the point where the following condition is met, 

A 2 

V2G F N e {t) = ~ ^-cos20 = y/2 G F N er . (6.93) 

2E 

To see these things explicitly, we will move to a (time-dependent) base of neutrino 
states, where the Hamiltonian H{t) is diagonalized by a time-dependent unitary 
transformation: 


Um(t)'H(t)U m (t) = diag (E 1 (t),E 2 (t)), (6.94) 

its) < M5 > 

U) = *-«>■(£)■ (6 - 96) 


where the time-dependent energy eigenvalues and mixing angle are given by 


i / Am 2 

£i,2(0 = -ly/2G F N e (t) + —ca*20 


(V2G F N e (t) - ^cos2 Of + (~sin20) 2 


2 E 


2 E 


tan20 m = 


2 E 


•sin2# 


^-cos20 - V2G F N e (t)' 


(6.97) 

(6.98) 


Fig. 6.5 shows how the mixing angle inside the matter 8 m changes as N e varies. 

We see that the resonance point, satisfying N e (t) = N er , is the point where the 
conversion of matter wave is most efficient and therefore the mixing 6 m is maximal, 
i.e. 


0 m = (6.99) 

We may also say that the 6 m changes most rapidly around the resonance point, and 
the “resonance region” can be defined as to be a region where |tan20 m | > 1. 

Actually, although we have diagonalized H(t ), the evolution equation is still a 
coupled differential equation, since the unitary transformation by U m (t ) is time- 
dependent. That is why we cannot solve the differential equation analytically. 
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Fig. 6.5 


Namely, we get a residual term with off-diagonal elements in the evolution equation 
in the base of (v m u F m2 ) 4 , 

'!(£)-{( T *<,)) + ( X t )}•(::;) ■ < 6 ' ioo > 

Then what is a merit of working in this base? Well, in this base the time evolution 
in the adiabatic case, the case where the adiabaticity condition is met, is easily 
solved. In fact if the condition 

|0m| « |£ 2 -£i| (6.101) 

is always satisfied, the residual term in the Hamiltonian can be safely ignored, and 
the system is approximately diagonalized: 


■ d_( Vml \ ( Ei(t) 0 \ f Vml \ 

dt\v m2 )~\ 0 E 2 {t) ) V v m 2 ) ' 


( 6 . 102 ) 


The adiabaticity condition will be the most non-trivial at the resonance point, where 
the change of 9 m is maximal and the difference of energy eigenvalues is minimum, 
\E 2 — Ei \ = ^rg-sin2 9. Thus the adiabaticity condition may be written as 


tan2# E 

d log/v,, | ^ A m 2 sin20’ 

dx \ res 


(6.103) 


where 9 m has been written in terms of d ^°f~ ) which should be evaluated at 
the resonance point. The physical meaning of this adiabaticity condition is that 
the wave length of the “beat” at the resonance point, i.e. ~ l/\E 2 — E\\ = 
S/(Am 2 sin20), should be much smaller than the width of the resonance region, 


i.e. (Amlsm 2 0 )/( V2G F ^|re s) = tan2 0/ ( ii°fp| res) . 
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The above differential equation is easily solved to yield 

= exp (-if Ei(t')dt') ■ i/ m i(0), 
Jo 

v m 2 (t) = exp(-i [ E 2 (t')dt') ■ i/ m2 (0). 

Jo 

Thus the survival probability is given as 
P{v e -A u e ) 

/-t 2 

= ( v e \ T exp(—i / H(t')dt')\u e ) 

Jo 


(6.104) 

(6.105) 


( cos 6 m (t) sin 9 m (t) ) | 6XP( ' lI ° Q El(t )dt) 


exp(—i fo E 2 (t')dt') 


+ 


cos9 m (0) 
sin0 m (O) 

cos0 m (f)cos0 m (O) exp(-i f Ei(t')dt') 

Jo 

sin0 m (f)sin0 TO (O) exp(-i f E 2 (t')dt') 

Jo 


(6.106) 


If an average is taken over the detection time t, the interference term may be ignored 
and the formula simplifies into 


P(i/ e -> v e ) = cos 2 0 m (f)cos 2 0 m (O) + sin 2 0 m (f)sin 2 0 m (O). 


(6.107) 


If the matter effect is ignored, i.e. if N e = 0 and 9 m (t) = 9 m ( 0) = 6 , this reduces 
to (6.78) for the vacuum oscillation. 

We now realize that even if the mixing 6 is small, almost complete conversion 
of a solar neutrino, starting as u e , into v tl at the solar surface is possible, so called 
MSW effect (Mikheyev and Smirnov, 1985; Wolfenstein, 1978). Namely, setting 
0 m (t) ~ 6, 9 m ( 0) — f and assuming 7V e (0) N er , the above formula is simplified 
into 


P{v e —> v e ) ~ sin 2 0, (6.108) 

which can be sufficiently small even for small mixing 9, in clear contrast to the case 
of (6.78), due to the resonance phenomenon. Such effective adiabatic conversion 
mechanism between two states, having level crossing, is visualized in Fig. 6.6. 

In Fig. 6.6, the dashed lines indicate the behaviors of the two diagonal elements 
of H(t) as the functions of N e . The solid curves correspond to the two eigenvalues 
E\(t) and E 2 (t). If v e is produced at a point with N e ( 0) N er , v e state is approx¬ 
imately on the upper solid curve, as the state v e is approximately an eigenstate of 
H( 0) with dominant energy due to the matter effect. As the matter density and 
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Fig. 6.6 


therefore N e decreases, e.g. as v e traverses from the center toward the surface of 
the sun, the state evolves along the upper solid curve, as long as the adiabaticity 
is met, and after passing through the resonance (level crossing) point, it will follow 
the upper dashed line, to end up as the v^ state. 

In the case that the adiabaticity condition is not satisfied, the above formula 
should be modified. In this case at the resonance point or at the level crossing, 
there should be a “jumping” between v m \ and v m 2 states, i.e. between the two 
solid curves. Let the probability of the jumping be Pj U mp ■ Then the two states 
interchange by a probability Pj U mp and remain to be the same by a probability 
1 — Pjump ■ Now the survival probability reads as 

P(u e -> v e ) = (1 - Pjump) (cos 2 0 m (t) cos 2 # m (0) + sin 2 # m (f) sin 2 # TO (0)) 

+ Pjump (sin 2 # m (t)cos 2 # m (0) + cos 2 # m (f)sin 2 # m (0)) 

= - +Pjump)cos26 m (t)cos26 m (0). (6.109) 

Setting # m (f) ~ #, # TO (0) ~ the formula reads as 

P(u e -> v e ) ~ sin 2 # + Pjump cos 2 #. (6.110) 


This is based on the reasonable assumption that the jumping takes place only 
near the resonance point. Then it will be a meaningful way of doing to replace 
the matter effect by its linear approximation so that they coincide at the resonance 
point. Such a replacement makes it possible to use the Landau-Zener formula for a 
level crossing (Landau, 1932; Zener, 1932), leading to 


jump 


= exp(- 


7rAm 2 sin 2 2# 
d log 


4Ecos2# a -^t^\rs 




( 6 . 111 ) 
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Fig. 6.7 


where the derivative 


d lQ g 

dx 


should be taken at the resonance point. 


In this way, substantial conversion of v e is possible by the MSW effect for solar 
neutrinos, as the typical example of the resonant matter oscillation. When the 
survival probability P(v e -A v e ) is fixed by a experimental data, it will determine a 
curve in the 2-dimensional space of the parameters, ,Am 2 ). When P(v e —► 

u e ) < 1/2, the curve is known to draw a closed triangle (roughly speaking) in the 
log-log plot of the parameters shown in Fig.6.7, which we call MSW triangle. 

The three sides of the triangle have their own physical meaning. Namely, the 
horizontal size represents the parameters, for which the level crossing starts to 
occur, JV e (0) ~ N er = with E being an average neutrino energy. For 

E = 10(MeV) and N e (0) = 100(p/cc), Am 2 ~ 10“ 5 (eV 2 ). The vertical line is 
given by a relation sin 2 # = observed survival probability, i.e. by use of (6.108). 
For the parameters on this line, all neutrinos with different energies experience the 
level-crossing and satisfy the adiabaticity condition. The diagonal line is given by a 
relation Pj um p = constant, i.e. by Am 2 • 4 £cos 29 = constant. This line separates the 

adiabatic (upper) and non-adiabatic regions, since the factor Am 2 • 4 ^?os:L inside 
determines the adiabaticity of the transition. This MSW triangle plays an 


■ jump 


important role when we derive the allowed parameter region from the data on solar 
neutrinos, as we will see briefly later in this Chapter. 


6.5 Neutrino Oscillation in the Three Generation Scheme 

So far the concrete formulae for the probabilities of neutrino oscillations by use 
of the mixing angle and the mass-squared difference have been given only for the 
simplified two generation scheme, for illustrative purpose. Needless to say, to make 
the analysis realistic, we have to derive formulae in the full three generation scheme, 
in terms of two mass-squared differences of neutrinos, 

Am 2 !, Am 2 !, (6.112) 
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and three mixing angles and one CP violating phase, 


$12, $23, $13) 

which appear in the MNS matrix as follows, 


<5, 


U 


C 12 C 13 


S 12 C 13 


-S 12 C23 - C\2S2ZSl3e 
S12S23 - Ci2C23Sl3e l<5 


is 


C12C23 - Sl2S23«13e 


iS 


~ C12S23 ~ Si2C23Sl3e 


( 1 

0 

0 \ 

/ C13 

0 

si 3 e~ 

0 

C23 

S23 1 

0 ■ 

1 

0 

v 0 — S23 
V23 Vl 3 Vi 2 > 

C23 / 

\ -si 3 e 85 

0 

Cl3 


-iS 


iS 


■sne 
S 23 C 13 
C 23 C 13 


(6.113) 



(6.114) 


where s,j and Cij represent sinfljj and cos 9^ , respectively. (This parameterization 
just corresponds to (4.183) for the KM matrix in the quark sector.) Such formulae, 
however, are complicated even for the case of vacuum neutrino oscillations, being 
described by the 6 parameters Arri^x, Amh, 612 , 023 , 613 ,5. Thus given data on the 
neutrino oscillations provide 5-dimensional hyper-surfaces in the parameter space, 
which of course cannot be shown graphically. Even if we get the allowed values of the 
6 parameters numerically, it may be practically impossible to get some meaningful 
physical information on the parameters from them. To be worth, we will not be sure 
with the obtained numerical values of the parameters, which mass-squared difference 
and mixing angle are really handling the oscillation, while each experimental data 
on the neutrino oscillation is usually provided in the form of the allowed region of the 
parameters assuming a simplified two generation scheme with generic parameters 
(9, Am 2 ). 

Fortunately, the hierarchical structure of two mass-squared differences of neu¬ 
trinos, suggested by the data of atmospheric and solar neutrinos experiments, 


A m\ x « Am^j, 


(6.115) 


enables us to reduce the problem of analyzing neutrino oscillations in the full three 
generation scheme to that in the effective two generation scheme. Namely, “re¬ 
duction formulae” are known to exist for each type of neutrino oscillations, which 
obviously make the analysis of the allowed region of theory parameters quite similar 
to those in the simplified two generation scheme, thus making the analysis quite 
transparent and physically meaningful. In addition, in this way we can clearly see 
which mixing angle and mass-squared difference out of the 6 parameters should 
be identified with those appearing in the experimental data. As we will briefly 
discuss in the next section, to explain the data on the atmospheric and solar 
neutrino oscillation two different scales of mass-squared difference are inevitable: 
Amltm — 2 x 10- 3 (eV 2 ) and A m 2 solar ~ 7 x 10 -5 {eV 2 ). Thus the identification 

A m 2 , = A m 2 atm ~ 2 x IQ- 3 (eV 2 ), 
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An-4 = Am 2 solar ~ 7 x 1(T 5 (eV 2 ), (6.116) 

should be made, which means the hierarchy (6.115) is a reasonably good assumption. 
(In fact numerical calculations show that the reduction formulae given below provide 
enough accurate oscillation probabilities.) 

Now we will discuss these reduction formulae for each of vacuum and resonant 
matter oscillations, successively below. 


6.5.1 Vacuum Neutrino Oscillation in the Three Generation Scheme 

Even under the hierarchical mass-squared difference (6.115), the reduction formulae 
of the oscillation probabilities will take different formulae, depending on which of 
A and Amfj is responsible for the oscillatory time-dependence of the oscillation 
probability. 


(A) Vacuum oscillation due to AmJj 

We first consider the vacuum neutrino oscillation, where larger mass-squared 
difference Am^ handles the oscillation. This, in turn means that Am^ is too 
small to participate to the oscillation. It will be relevant for the experimental 
situation satisfying, 


A m|i 

E 


L 


AmJt 

E 


L 


3.6 x 1(T 2 

( 2x 10 -3 e V 2 


/_A 

WxlO-^eV 2 > 


(l Gev) 




1 . 0 - 


(lGev) 


lOOfcm 


' lOOfcm 

) > 1 , 


) « 1 , 


(6.117) 


where L denotes the distance between the production and the detection points of 
neutrinos. The oscillation of atmospheric neutrinos and terrestrial “long-baseline” 
accelerator neutrino oscillation experiments may be classified into this category. It is 
worth while noticing that the oscillation length of atmospheric neutrino oscillation, 
implied by the zenith-angle dependence of its survival probability, is not the order 
of the size of the earth, but is of the order of a few hundred kilometers, roughly the 
distance from the Super Kamiokande detector till the top of the atmosphere at the 
Zenith-angle = f. Strictly, for the up-going neutrinos in the Super-Kamiokande 
detector, i.e. from the opposite side of the earth, the oscillation due to Am 2 2 
may not be negligible, though we do not consider the effect in this textbook. The 
wavelength is comparable to the baselines of K2K (KEK to Kamiokande) and other 
proposed long baseline experiments. 

Thus, ignoring the smaller mass-squared difference, Am 2 x = 0, the general 
oscillation formula (6.67) reduces into 


. Amoi 

P{v a -a vp) = 1 6 a0 - U P3 U: 3 ( 1 - e-*-^)! 2 , (6.118) 
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due to the unitarity J2i UpiU* { = 5 a p. Several survival and transition oscillation 
probabilities, relevant, e.g., for the atmospheric neutrinos, now takes simple concrete 
forms in terms of mixing angles 

, Ami, 


= l-4(l-|[/ M 3^)|f/ M 3|W(- 






t 31 

4 E 


t) 


1 - 4(1 - sin 2 ^23cos 2 0i 3 )sin 2 023cos 2 0 1 3 sin 2 ( 

1 - sin 2 2023 sin 2 (^^I<), 


4 E 


AE 

P(u e -> v e ) = 1 - 4(1 - |C/ e3 | 2 )|C/e 3 | 2 sin 2 ( 


Ami 


31 


AE 


t) 


1 - sin 2 2013 sin 2 (^~t) ~ 1, 
4 £j 


t) 

(6.119) 


( 6 . 120 ) 


v T ) = 4|(7 M3 | 2 |t/r3| 2 sin 2 (^^-t) 

2 ( A77I31 


= sin 2 20 23 cos 4 0 i 3 sin 2 ( 

- sin 2 20 23 sin 2 ( A 4 ^ -f), 


AE 


t) 


Ve) = A\U e3 \ 2 \U^\ 2 SlVL 2 {^^-t) 

- sin 2 20i 3 sin 2 023 sin~ 0, 

4 £j 


( 6 . 121 ) 


( 6 . 122 ) 


where the approximate formulae are for small 0i 3 , reported by e.g. CHOOZ exper¬ 
iment, 

sin 2 2013 < 0.2. (6.123) 


These formulae imply that, roughly speaking, v e does not oscillate and is decoupled 
from other neutrino species, and only v M v T oscillation is possible. This is what 
we expect for atmospheric neutrinos. We should also note that the formula for the 
v M -H- v T oscillation (6.121) is of exactly the same form as the transition probabil¬ 
ity in the simplified two generation scheme, i.e. as (6.69), though the replacement 
0 -t 023, Am 2 j -> Am 2 , has been made. Namely, in this case the mixing handling 
the oscillation is known to be 0 23 . Thus, the experimental data on the atmospheric 
neutrinos should provide constraints on the set of parameters (0 23 , Am 31 ), as we 
will see in the next section. Concerning the possible CP violation in neutrino os¬ 
cillations, the formulae given above are not suitable, since ignoring Am^ and 0i 3 
will erase the CP asymmetries, just as in the quark sector (see (9.57), (9.58)). 


(B) Vacuum oscillation due to A m\\ 

Next, we consider the vacuum neutrino oscillation, where the smaller mass- 
squared difference Am^ handles the oscillation. This, in turn, means that Am|j is 
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so large that the oscillation length due to A m 2 i is much shorter than the baseline L, 
i.e. the distance between the production and the detection points of the neutrinos. 
Thus the oscillatory factor due to Ashould be time-averaged, i.e. 


cos( 


2 E 


t) —t 0, 


(6.124) 


which means the interference of the matter waves of i / 3 with those of v\ and t / 2 can 
be ignored. This treatment will be relevant for the experimental situation satisfying, 


Arr4 

E 


Am|i L 


E 


= 7.2- 


/ 

17x10-' 


eV 2 


(smv) 100km 


-) - 1 , 


= 2.0 x 10 2 


( _ Am l i_i 

V 2x 10“ 3 eV 2 / 


(dW) WQkm 


-)»l. 


(6.125) 


This is exactly the case for the KamLAND experiment, recently started in Japan. 

Ignoring the interference of the matter waves of v 3 with those of Vi and v 2) the 
general oscillation formula (6.67) reduces into 


P(y a -> Vfi) = \S a p - U 03 u * a 3 - U 02 U* a2 {l - e-^ 4 )! 2 + |C/ /33 | 2 |C/a3| 2 . ( 6 . 126 ) 


In particular, the survival probabilities for a = (3 are expressed by the following 
reduction formula to the effective two generation scheme: 

P{v a -> u a ) = (1 — \U a3 \ 2 ) 2 P eff (u a —¥ v a ) + |f/ a3 | 4 , (6.127) 


where P e /f denotes the survival probability in the effective two generation scheme, 
which is obtained by the replacing the factor sin 2 0 in the formula for two generation 

\TJ 12 

scheme by a factor ■ Namely, 


Peff(y a -t Uq) = 1 - 4 


\U a 2 \ 2 (l-\U a 2 \ 2 -\U a3 \ 2 ) . ,.A m 2 . 


(1 - \ U a3 \ 2 ) 2 


sin 2 (- 


4 E 


t). 


(6.128) 


For instance, the concrete reduction formula in terms of mixing angle for the v e 
survival probability, which we denote by 5, is given as 


S = cos 4 0i 3 • S e ff( 0 i 2 , Am 2 !) + sin 4 0 13 , 

~ cos 20i 3 ■ S e ff( 6 i 2 , Am^) (6.129) 


where the survival probability of i/ e in the effective two generation system, denoted 
by S e ;f is described by a set of parameters (0i 2 , Am^) and reads as 

\ 2 

S e ff{ 0 i 2 , Am^j) = 1 - sin 2 2#i2 sin 2 (-^ i t). (6.130) 

The approximate formula in the second line of (6.129) is for small 0] 3 , (6.123). The 
formula (6.129) with (6.130) will be also applicable to the survival probability of FJ, 
relevant for the KamLAND experiment, as well, as long as the small matter effect 
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due to the neutrino interaction with the earth is negligible. Thus, in this case the 
oscillation is known to be handled by the mixing angle 0i 2 . 

(C) CP violation in neutrino oscillation 

As we will discuss in some detail for the quark sector in Chapter 9, to get CP 
violation we need the full interplay of three distinct generations of quark. In other 
words, if there is mass degeneracy in up- or down-type quark sector, or if some of 
generation mixing angles disappear, the CP violating observables will vanish (see 
(9.57) and (9.58)). The same arguments hold for the lepton sector (at least when 
neutrinos have Dirac masses). Thus the formulae given in (A) and (B) above, where 
either A m 21 or Am^ is responsible for the oscillations, are not applicable when we 
consider the CP violation in neutrino oscillation, and we should go back to the 
original formula (6.67). 

The clear indication of CP violation in the leptonic sector will be the asymme¬ 
try between the oscillation probabilities of neutrinos and the corresponding anti¬ 
neutrinos defined by 

A a 0 = P ( u <* -> V 0 ) ~ P{pct -» V 0 ). (6.131) 

The consequence of the CPT theorem shown in (6.72) (for the negligible matter 
effect) implies that the CP asymmetry is identical to the T asymmetry defined by 

A lcS = p i v <* -> v 0 ) - P{vp -> v a ). (6.132) 

Namely, = A' ! n p. This can be confirmed explicitly in the formula (6.67). 

Namely, the charge conjugation causes the replacement U a i —> U* { for every MNS 
matrix elements, which is equivalent to the replacement a (3 in (6.67). 

Then, a trivial relation A’ a p = —Aj ia tells us 

A c j = -A 0 a > (6.133) 

which in particular means = 0, namely 

P{v a -> v a ) = P(v a -> v a ). (6.134) 

Thus the CP violating effects do not appear in the survival probabilities and appear 
only in the transition probabilities. On the other hand, since the neutrino oscilla¬ 
tion we are considering here does not break lepton number, the unitarity, or the 
probability conservation read as 

^ P(Va -*V 0 )= ^2 P{Va V 0 ) = 1, (6.135) 

/3=e,/x.r /?=e,/z.T 

which means Yip = 0- Together with A%£ = 0 and A^p — —A^ we get a 
relation for q = e, Aff = A^f. Similarly, for a — fi we get A^f = Aff- We thus 
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obtain a remarkable relation as the specific feature of the three generation model, 

A ef = A° P = A re = A CP , (6.136) 

This is essentially the consequence of the fact that in the three generation model 
the CP violating phase is unique, i.e. 5 alone. 

Let us now derive the explicit form of A GP . From (6.67) we realize that for an 
arbitrary choice of the pair (a, ft), 

Am 2 

A ap = -+ p s) - P(p° -> up) = - 4 E • sin(-^t). 

i<j 

(6.137) 

Write Im (UaiU^UpjU*^) = J a p,ij ■ By definition we readily know J a p,ij = ~Jp,a,ij 
and J a p,ji = ~ Jap,ij • On the other hand, the orthogonality of the MNS matrix 
implies -4/3 ,ij = 0. We thus know for j = 2, e.g., J a p y 12 + Jap ,22 + J a p ,32 = 
J a p ,12 + Jap ,32 = 0 , Similarly, for j = 3 we get J a p ,\3 + To/ 3,23 = 0 . By use of 
the property J a p,ji = - Jap,ij , we thus conclude J a p ,12 = Jap, 2 3 = J a p, 3 i- Similar 
reasoning provides J etl ,ij = = J T e,ij■ In this way we have demonstrated that 

actually J a p,ij is unique, in spite of the possible choices of the combinations of 
a, /? and i,j. Hence we define J = J eMi 12 , which is the leptonic counterpart of the 
Jarlskog parameter (9.58) in the quark sector. In terms of the parameterization in 
(6.114), 


J = Ci2Si2C23S23Ci 3 Sl3S<5, j 


(6.138) 


where s$ = sin<S with 6 being the CP violating phase. Now setting as a = e, /3 = p 
in (6.137) and utilizing J eM , i2 = T e#i ,23 = Je»,3i, we readily get 


^ = -4J {5i „(^ i) + s i»(^l ( ) +S in(^ i)} , 


2 E 


2 E 


(6.139) 


where A m 2 j — m 2 — m 2 . The appearance of the factor J is quite reasonable, as 
it should be the unique re-phasing invariant measure of CP violation in the three 
generation model, as we will see in section 9.4. 

The above formula clearly shows that, as we anticipated from the introductory 
argument, the CP asymmetry A CP vanishes when (at least) one of the following 
situation is realized, though they may be unrealistic: (1) there is a degeneracy of 
neutrino masses, i.e. Am 2 i = 0 or Am 31 = 0, (2) one of the mixing angles 6ij or CP 
violating phase S vanishes, (3) time average of the neutrino oscillation probabilities, 

i.e. the average of the factor sin( t) is taken. Any experiment, which aims at 
the observation of the CP asymmetry should carefully avoid the situations (1) and 
(3). In particular, the experiment should be sensitive to the smaller mass-squared 
difference A rri^i ■ It should also be mentioned that the presence of the matter effect 
mimic the CP asymmetry, just because the matter effect for neutrinos and that for 
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anti-neutrinos are not the same. Thus how to avoid or extract the matter effect is 
another challenging issue. 


6.5.2 Resonant Matter Oscillations of Neutrinos in the Three Gen¬ 
eration Scheme 

Once the matter effect due to the weak interaction of neutrinos with the matter, e.g. 
inside the sun, is included, the time evolution equation is not analytically solvable, 
and the analysis of the oscillation probabilities in the full three generation scheme 
becomes quite cumbersome, in general. Still, however, some reduction formulae are 
possible if the matter effect is much smaller than AmJj, i.e. 


Am|i 

2 E ’ 


\PlG p N e 


Am^ 

2 E ’ 


(6.140) 


which is the case for solar neutrinos. (Concerning the neutrinos emitted from super¬ 
novae, the matter effect can easily exceed Am 2 1 /(2E) in the region near to the core, 
and two-step level crossings become possible. Though there exists an extension of 
the reduction formula, applicable for the case, it is out of scope of this textbook.) 
Here let us focus on the n e survival probability S. We will now see that, even in 
the presence of the matter effect, a similar reduction formula to the one given in 
(6.129) holds; 


S = cos 4 0 13 • S e ff{0i2, Am\ x \a ef j) + sin 4 #i 3 , (6.141) 

where a e // = cos 2 # 13 a(x) = cos 2 0 j3 \Z2GpN e (x) is the “effective” matter effect, 
reduced by the factor cos 2 #i 3 , in the effective two generation scheme (Lim, 1987; 
Smirnov, 1992; Shi and Schramm, 1992). 

Now let us derive this formula. We start with the time-evolution equation in 
the three generation scheme, (6.91). What we attempt is to separate the 3 x 
3 Hamiltonian matrix, though it cannot be exactly diagonalized because of the 
presence of the matter effect, into those for one and two neutrino systems, i.e. 
3 —> 1 + 2, by making some suitable approximation. Namely, we attempt to make 
the Hamiltonian of the form of block-diagonal, relying on the hierarchy (6.140). It is 
reasonable to expect that one mass eigenstate v 3 , having the dominant “energyh, i.e. 
the diagonal matrix element 2£ 31 , is decoupled form other two states, orthogonal 
to 1 / 3 , namely the linear combinations of v x and v^. Thus, in the base of mass- 
eigenstates, the Hamiltonian is approximated by H ~ diag(0,0, A ^ 31 ), roughly 
speaking. Instead of mass-eigenstates v\ and r/ 2 , however, it will be convenient to 
take their linear combination defined by u’ e = cos# 12^1 + sin#i 2 i' 2 , = —sin#i 2 r , i + 
cos#i 2 ^ 2 - The reason is, in this way, the 2x2 sub-matrix of the Hamiltonian in the 
space of p' e and v'^ has a similar form to that in the two generation scheme, since 
only 1 /' has the matter effect and the term of a(x) appears in the diagonal element 
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of the sub-matrix. Namely, we will work in the base 



COS0! 2^1 + Sin^i2^2 

sin^i 2 i / i + cos# 12 ^2 

^3 


(6.142) 


Recall that from (6.52), (6.114) the relation ip w = V 23 V 13 V 12 ipm holds between the 
column vectors of weak- and mass- eigenstates, ipw = (^ei, v^l, v t lY and ip m = 
(i'iL,^ 2 L,^ 3 L) t - Thus, the relation (V^ipw) = ki 3 (ki 2 ip m ) means £, = V^tpm = 
Ti 3 (^ 23 i’w)- More explicitly, 



cos#i 3 i/ e — sin#i 3 e %& i> T \ 
&m6i3e l5 ii e + cos#i3p r / 


(6.143) 


where 


COS#23^m - sin#23^r 
sin#23 Vfi + COS#23 V t 




(6.144) 


where the approximate relations are for # 2 3 — f, as suggested by the data on the 
atmospheric neutrino oscillation. Actually, as far as the survival probability of v e 
is concerned, we may set #23 = 0 without changing the answer, since both of initial 
and final states have nothing to do with i/ M , v T and the matter effect is invariant 
under the rotation of these states. As we expected, among 1 /' and z/ , only v‘ e has 
the i/ e component and, therefore, the matter effect. 

Now the time-evolution equation in the base of £ is written as 


Via J ^ 

Ai 2S? 2 + ac i3 A12S12C12 0 \ / 0 OasisCiaXl 

A12S12C12 A12C12 0 } + I 0 0 0 ] / 

0 0 A 13 + asl 3 J \as 13 C 13 O 0 / J 

(6.145) 

where A 12 = A 13 = and c,j = cos 6ij, Sij = sin dij. The second matrix 

in the right-hand-side of the above equation may be treated as a small perturbation 
under the hierarchy (6.140), and may be ignored. One may wonder why the small 
quantities in the 2 x 2 sub-matrix in the first matrix in the right-hand-side are kept. 
This is because at the leading 0-th order of perturbation, i.e. eliminating these 
small quantities as well, the Hamiltonian has degenerate eigenvalues 0 in the space 
of sub-matrix. Thus the two eigenstates in the sub-space is not fixed at the 0-th 
order, and the small perturbation should be kept only in the sub-space. We can 
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read off the factor cos 2 $ i 3 a(t) in the sub-matrix, which comes from the relation 
v' e — cos 0 i 3 ^ e - smdi 2 e~ lS D r . Namely, in this sub-space the matter effect is reduced 
to 


a e ff = cos 2 0 i 3 a(t). (6.146) 

Thus the time-evolution is described approximately by 

( A 12 sf 2 + acf 3 A 12 s 12 ci 2 0 \ 

i~j 7 — I A 12 S] 2 c 12 Ai 2 c 2 2 0 1 £. (6.147) 

\ 0 0 A 13 + asf 3 J 

Let us note that the separation 3 —> 1 + 2 has been realized, and the time-evolution 
in the subspace of is exactly the same as that of the two generation scheme, 

except for the replacement of the matter effect a —i a e f /. 

Since v e state corresponds to (ci 3 , 0 , Si 3 e t<5 )*, the survival probability S is given 
as 

5 = |c 2 3 Ae//K-^^) + 4e- i(Al3+as? 3) t | 2 
= cos 4 $13 • S , e//($i 2 ,Am| 1 ;a e //) + sin 4 $ 13 
~ cos 2$ 13 • S e ff(d 12 , Am^jae//), (6.148) 


which is nothing but the reduction formula (6.141). The approximate formula in the 
third line is for small $ 13 , (6.123). In the derivation we have ignored the interference 
of v 3 with other two states, as the oscillation due to A 13 is very rapid: e~* Al3t —1 0 . 
The effective survival probability S e ff ( 812 , Am^; a e ff) should be calculated by use 
of the 2x2 sub-matrix in the Hamiltonian of (6.147). Let us note that if we neglect 
the small $ 13 , S = S e ff and v e = v’ e . Namely, in this case the two generation 
treatment is exact and the solar neutrinos oscillate into v' = 


Ve ~ 


(6.149) 


and we expect roughly the same amount of and v T in the solar neutrinos. The 
error of the reduction formula (6.141) is anticipated to be handled by the relative 
strength of the ignored off-diagonal matrix element to the dominant matrix element, 



6.6 Atmospheric and Solar Neutrino Oscillations 

The smallness of neutrino masses and therefore small mass-squared differences 
means that the oscillation lengths of possible neutrino oscillations are sizable, and 
to test these oscillations are very challenging. Though a long distance L from the 
source to the detector of the neutrinos is desirable to detect such oscillations with 
long wave lengths, it in turn means the neutrino flux rapidly decreases as 1/L 2 . 
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Thus the neutrino source with large L and very intense flux should be ideal one to 
test the oscillation. We can find such ideal neutrinos sources with astrophysical or 
cosmological origin, i.e. atmospheric neutrinos and solar neutrinos. 

As we now very briefly discuss, both of these neutrinos have experimental puz¬ 
zles, called atmospheric and solar neutrino problems (anomalies): the observed 
event rates of these two-types of neutrinos are significantly smaller than the theo¬ 
retical predictions. We will see below that these reductions of the neutrino event 
rates are naturally accounted for by virtue of the atmospheric and solar neutrino 
oscillations. 


6.6.1 Atmospheric Neutrino Oscillation 

It has been known for long time that high energy particles (mainly protons) are 
flowing toward the earth, almost isotropically. These particles are called (primary) 
cosmic ray, and interacts with the atmosphere surrounding the earth via the strong 
interaction, to produce pions. The decay of the pions due to the weak interaction 
then copiously produces neutrinos, called “atmospheric neutrinos”, through the 
chain of decay processes: 

7 r + -4 p + + Vp, -> e + 4- v t + Up. (6.150) 


Let us note that the decay rate of 7 r + —> e + + v e is strongly suppressed compared 
with that of n + -a p + + shown above roughly by a factor (m e /m M ) 2 (see (2.49)). 
Though the absolute values of v t and fluxes have some uncertainties in their 
calculations, in the ratio of these fluxes (i/ M + u^)/(u e +P e ) denoting the flux, 
etc.), such uncertainty is considerably reduced. The above decay chain suggests 
that the ratio is roughly 2 . 

The atmospheric neutrino experiment by Super-Kamiokande collaboration claims 
that the “double ratioh of the observed (i/ M + P M )/(i/ e + v e ) to that of the predicted 
value remarkably deviates from 1 , 


{y “1“ Up)obs / {y e "b ^e)o6s 
[Vf! + Vfi)pred/(u e 4" &e)pred 


(6.151) 


This discrepancy is so-called atmospheric neutrino problem (anomaly). To be more 
precise, Super-Kamiokande data show that the i/ M capture rate is considerably 
smaller that the expectation, while that of u e is consistent with the prediction. 
Thus, probably the most natural explanation of this problem is to invoke to the 
atmospheric neutrino oscillation 


-> Mr, (6.152) 

while atmospheric u e do not experience any oscillation. This observation is consis¬ 
tent with the theoretical expectations of neutrino oscillation probabilities for small 
$13 given in (6.119) - (6.122). Comparing the survival probability P(m m -> m m ), 
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sub-GeV multi-GeV 



cos© cos© cos© cos© 


Fig. 6.8 The Zenith-angle ( 9 ) dependence of p-like and e-like neutrino events for the scattered 
particles in the energy ranges of sub-GeV and multi-GeV. The hatched regions are expected events 
for no neutrino oscillation, while the bold lines indicate the (best-fit) expected events assuming 
the neutrino oscillation Uy —> v T , From the result of Super-Kamiokande collaboration, (Super- 
Kamiokande collaboration, 1998). 


(6.119), with the data, Super-Kamiokande has derived the allowed parameter re¬ 
gion 


Amjj = (1.3 - 3.0) x 10 -3 (eV 2 ), (6.153) 

sin 2 2023 > 0.9. (6.154) 

To be interesting, the best fit value of sin 2 2023 is 1- Namely the maximal mixing 
has been realized in the lepton sector, in clear contrast to the case of quark sector ! 

To get the restrictive range of , the data on the Zenith-angle dependence is 
very helpful. Namely, the dependence of the atmospheric neutrino capture rate on 
the Zenith-angle 0 at Kamioka, shown in Fig. 6.8, is probably the most convincing 
direct evidence of neutrino oscillation. 

The figures tell us that the muon-like event rates at cos 0 = -1, i.e. 0 = tt, 
corresponding to the atmospheric neutrinos from the opposite side of the earth, 
indicates a clear deficit, while the event rate at 0 = 0 is quite consistent with the 
prediction. This may be attributed to the oscillation of atmospheric neutrinos 
which traverse long way toward the Super-Kamiokande detector. The remarkable 
decrease of the capture rate at 0 = f shows the wave length of the oscillation 
is of the order a few hundred kilometers. This is why the terrestrial test of the 
atmospheric neutrino oscillation at the long-baseline experiment, such as K2K, is 
possible. 
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6.6.2 Solar Neutrino Oscillation 

Another ideal source of neutrinos to perform the neutrino oscillation experiment 
sensitive to the very small neutrino mass-squared differences, is the Sun. As is 
well-known, the energy emitted by the Sun is due to the nuclear fusion processes at 
the center region of the Sun. A main series of reactions is called “p-p chainh: 

p + p -» D-t-e + +r/ e , < E v >= 0.26(MeV), 

e + 7 Be —> 7 Li + u e , E v — 0.86 {MeV), 

8 B -i 8 Be* + e+ + v e , < E u >= 7.2 (MeV), 

: , (6.155) 

where three reactions, which mainly contribute to the event rates at the on-going 
solar neutrino experiments, have been shown, together with the average energies < 
E > (a fixed energy for the second reaction concerning Be) of the emitted neutrinos. 
Actually, the second and the third reactions actually belong to different branches 
of the chain reactions. The neutrinos emitted by these three processes are called 
pp, Be and B neutrinos, respectively. The pp neutrino dominants the solar neutrino 
flux, though its average energy is relatively low and its detection is not easy, while 
Be and B neutrinos have higher energies and are detectable even in the experiments 
with relatively higher threshold energies, such as Super-Kamiokande, Homestake 
and SNO experiments, in the mines in Japan, U.S. and Canada, respectively . 

The chain reactions are summarized in the following net reaction of the nuclear 
fusion 


2e~ +4p 4 He + 2z/ e + 7 (26.73 MeV). (6.156) 

Though of course the nuclear fusion itself is due to the strong interaction, the 
presence of the weak interaction causes the emission of the solar neutrinos v e . 

The experiment for the detection of solar neutrinos, solar neutrino experiment, 
started by the pioneering work by R. Davis and collaborators, at Homestake mine, 
the “Cl experiment” by use of C 2 CI 4 . Now the data on the solar neutrino capture 
rates are available from four different types of solar neutrino experiments: 

1 . Cl experiment: 

v e + 37 Cl —> 37 Ar + e~, obs./exp. = 0.34 ± 0.03, 

(mainly) sensitive to B, Be neutrinos ( E t h = 0.81 MeV) 

2. Super-Kamiokande experiment: 

u + e~ -A v + e~, obs./exp. = 0.47 ±0.02, 
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sensitive to B neutrinos ( E t h = 5 MeV) 

3. Ga experiment (SAGE, GNO(GALLEX)) : 

v e + 71 Ga —> 71 Ge + e _ , obs./exp. = 0.55 ± 0.05, 

(mainly) sensitive to pp, Be, B neutrinos ( E t h = 0.23 MeV) 

4. SNO experiment : 

(charged current reaction) 

v e + D — >e~+p + p, obs./exp. = 0.35 ± 0.02, 

(neutral current reaction) 

i/ + D —>v + n + p, obs./exp. = 1.01 ±0.13, 

(sensitive to B neutrinos) (Eth — 6 . 8 MeV), (6.157) 

where Cl and Ga experiments are radio-chemical experiments, while Super- 
Kamiokande and SNO experiments utilize elastic scattering of neutrinos off 
electron (Super-Kamiokande) and the charged current interaction of v e and the 
neutral current interactions of all species of neutrinos, v e , and i/ T , with the 
deuteron D (SNO). In the above equations, “obs./exp.” means the ratio of the ob¬ 
served solar neutrino capture rate to the expected one, predicted by the standard 
solar model (Bahcall, et al., 2001). E t h denotes the threshold energy of each exper¬ 
iment. We realize that these experiments have all different E t h and are sensitive to 
different energy ranges. Hence we can get independent complementary data from 
these experiments, which enables us to restrict the allowed region of parameters of 
the theory, as we will below. We know from the above “obs./exp.” ratios that all 
experiments, except SNO neutral current experiment, have shown clear deficits of 
solar neutrinos. This anomaly is called “solar neutrino problem”. 

This solar neutrino problem find a natural solution in the neutrino oscilla¬ 
tion of solar v e . As the Cl and SNO charged current experiments claim that 
obs./exp. ~ 1/3, it is difficult to explain the deficits in the framework of the vacuum 
oscillation. Even though the (time-averaged) survival probability P(v e —> v e ) = 1/3 
is possible in he three scheme, it requires maximal mixings, i.e. the fine tuning of 
mixing angles (see (6.80) and the argument below that equation). Thus we should 
rely on the resonant matter oscillation of solar neutrinos in the three generation 
scheme, discussed in subsection 6.5.2. We have seen there that, solar v e , roughly 
speaking, oscillate into the even mixture of “active” (neutrino states with weak in¬ 
teraction) and i/ T (see (6.149)). Comparing the observed obs./exp. ratio with the 
reduction formula (6.141) or (6.148) of the survival probability, we get (with a little 
uncertainty of the small # 13 ) a MSW triangle, Fig. 6.7, in the parameter space of 
(log(sin 2 2 #i 2 /cos 2 ^ 12 ), log Am^) for each of experiment. Each experiment, having 
different deficit rates and energy sensitivities, will draw different MSW triangles. 
By taking the overlap of these triangles, we finally get a allowed region of the pa¬ 
rameters, called LMA (Large Mixing Angle)-MSW solution, as is seen in the right 
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Fig. 6.9 The allowed area in the parameter space of (sin 2 (20 12 ), Arn 2 2 ), where 0 12 and Amj 2 are 
simply written as 6 and Am 2 , imposed by the data of solar neutrino experiments. The hatched area 
is the allowed area at 95% C.L. obtained by the combined analysis on the neutrino flux observed 
in the Ga, Cl and Super-Kamiokande experiments. The area of the triangular shape, roughly 
speaking, is the exclusion area due to the day/night spectrum analysis by Super-Kamiokande. 
From the result of (Super-)Kamiokande collaboration, (Super-Kamiokande collaboration, 2001). 


upper area of Fig. 6.9. 

The best fit values are: 


A m 2 21 = 7.0 x 10“ 5 (eK 2 ), (6.158) 

tan 2 0 12 = 0.34. (6.159) 

Let us note the SNO neutral current result obs./exp. = 1.01 ± 0.13 in (6.157) 
is consistent with the neutrino oscillation (6.149), since the i/ M and v T are active 
states and contribute to the capture rates of solar neutrinos, just as v e does. Namely 
only in the SNO neutral current experiment, the capture rate is expected to be the 
same as the prediction of the standard solar model, which is exactly what has been 
confirmed by the experiment. 

When we discuss the deficit of the solar neutrinos, of course we have to know 
the prediction of the standard solar model. There exist, however, a way to confirm 
the neutrino oscillation, irrespectively of the solar model, by use of the combined 
results of Super-Kamiokande (Super-Kamiokande collaboration, 2001) and SNO 
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charged current reaction (SNO collaboration, 2001)! This should be a direct and 
convincing proof of the solar neutrino oscillation. The clue is in the fact that in 
the elastic scattering of solar neutrinos off electron, not only u e but also and 
v T contribute to the event rate, though the ratio R„ of scattering cross section of 
v^ and v T to that of v e is roughly R a = 1/6 to 1/7, because of the presence of 
the additional charged current process Fig. 6.4(b) in the case of the v e scattering. 
On the other hand, in the SNO charged current reaction, only v e can contribute to 
the process, as the charged current processes of other neutrinos require higher than 
available neutrino energies. Thus if the B neutrino flux predicted by the standard 
solar model is denoted by </>ggM> t ^ ie “effective” fluxes at these experiments, denoted 
by 4 >SK an d <t>cmo(CC) should be given as 

^SK ~ ^SSM x i s + (1 - S)R<?} (6.160) 

^SNO^^) = ^SSM x S (6.161) 

where S is the survival probability (6.141). (For illustrative purpose, we have ig¬ 
nored the dependence of S and R a on the solar neutrino energy.) It is quite impor¬ 
tant to note that the observed value <j> gj^ ~ 2.4 x 10 6 (/cm 2 /s) is certainly larger 
than (/gjyo(CC) — 1.8 x 10 6 (/ cm 2 /s ). This means S < 1 and that among the 
solar neutrinos there surely exist the active neutrinos and v T , which strongly 
suggests the solar neutrino oscillation v e -A i/ M , v T . (It also implies that the so¬ 
lar neutrino oscillation into a sterile state (states without weak interaction) is not 
favored.) Furthermore, equating (6.160) and (6.161) with the observed values, we 
get 0gsM = 5-4 x 10 6 (/cm 2 /s) and S = 0.33, for R a = 1/6. The obtained </>gSM 
is quite consistent with the prediction of Bahcall et al., (5 ± 1) x 10 6 (/cm 2 /s). 

The most recent remarkable experimental achievement is the confirmation of 
the solar neutrino oscillation in quite different circumstance, i.e. the terrestrial 
oscillation experiment of v e produced by nuclear reactors in Japan, i.e. KamLAND. 
As we have seen in the subsection 6.5.1, the experimental setting is such that the 
oscillation is due to the smaller mass-squared difference Am^ and , just as 
in the solar neutrino oscillation, though the matter effect may be safely ignored, 
even though not negligible. Thus the formulae (6.129) and (6.130), similar to the 
one in the solar neutrino oscillation (6.141), gives a prediction on the deficit of D e 
at the detector. The predicted deficit rate for the MSW-LMA solution, (6.158) 
and (6.159), is quite consistent with the observation, which started very recently 
(KamLAND collaboration, 2003). 


Problems 

6.1 Show that the relation (6.17) holds, to verify that a free Weyl fermion and a 
free Majorana fermion are equivalent, as far as they are massless. 
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6.2 Verify the relation (6.41) when (6.40) holds. 

6.3 Show that the coefficient of the (irrelevant) operator (6.48) provides a Majorana 
mass for vl when the Higgs doublet <fi develops its vacuum expectation value. 

6.4 Verify the relation (6.66). 

6.5 Verify that the adiabaticity condition (6.101) at the resonance point is equiva¬ 
lent to the condition (6.103). 

6.6 State the intuitive physical meaning of (6.108). 





Chapter 7 

SUPERSYMMETRY 


7.1 Supersymmetry and The Hierarchy Problem 

Supersymmetry (SUSY for short) is a symmetry which connects particles and fields 
with different spin-statistics, i.e. bosons and fermions. When a Lagrangian density 
is invariant under a properly chosen transformation which connects particles whose 
spins differ by 1/2, the theory is said to be supersymmetric. The supersymmetry 
is independent of any internal symmetry such as gauge symmetry, and therefore 
connects a pair of particles, “superpartners” of each other, with different spins 
but the same quantum numbers such as electric charge, weak isospin, color etc. 
As we will see later the generator of the supersymmetry transformation is known 
to commute with the generator of space-time translation P fl (4-momentum), and 
therefore with P%. We thus learn that the superpartners have the same mass. 

If we wish to apply the supersymmetry to the description of quarks and leptons, 
the supersymmetry clearly should be broken spontaneously or explicitly, as we have 
not seen any spin = 0 scalar particles with the same charge and the mass as those of 
quarks and leptons. Though it is not hard to break supersymmetry spontaneously, it 
easily happens that some of superpartners of light quarks or leptons are lighter than 
these particles, thus contradicting with the reality. In the Lagrangian, to put by 
hand the terms which break supersymmetry explicitly, such as the additional mass- 
squared term for the scalar electron, is also possible and an easy way. However, 
without any guiding principle, such arbitrarily introduced explicit SUSY breaking 
terms easily lead to an unacceptable prediction, such as too large flavor changing 
neutral currents. The detailed discussion on the mechanism of SUSY breaking is 
beyond the scope of this textbook, though we briefly discuss the mechanism of 
spontaneous SUSY breaking in a later section, and we just consider here how large 
the SUSY breaking mass scale can be. The scale will be fixed so that the SUSY 
works as a solution of the so-called hierarchy problem. 

What is the hierarchy problem? Though it has been perfectly explaining all 
phenomena (except the recently reported neutrino oscillation), the standard model 
of elementary particles is widely believed not to be the final theory; in particular, 
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Fig. 7.1 


it has too many free parameters and the gauge couplings are not really unified as 
the gauge group is not a simple group. We thus expect that the standard model is 
valid up to some energy (mass) scale A (a physical cutoff of the standard model), 
and at A the theory is replaced by more fundamental new theory incorporating the 
standard model, i.e. “beyond the standard model” or “new physics”. The standard 
model is thus regarded as the “low” energy effective theory with the cutoff A. 
The cutoff A is Mgut if the new physics is Grand Unified Theories (GUT), with 
e.g. Mgut ~ 10 15 GeV for SU(5 ) GUT, and M p i ~ 10 19 GeV (Planck mass) if 
the new physics is a unified theory with gravity. These mass scales, Mgut, M p i 
are much higher than the weak scale Mw, the typical mass scale of the standard 
model. The hierarchy problem is the problem of how to maintain the hierarchy 
of these mass scales, i.e. Mw <C Mgut,M p i. The problem is the most serious in 
the sector of scalar particle such as the Higgs particle, as the Higgs mass-squared 
get a huge quantum correction proportional to Mq UT or M 2 ,. This large quantum 
correction manifest itself as a “quadratic divergence” A 2 if we calculate the quantum 
correction to the Higgs mass-squared in the effective theory, i.e. in the standard 
model: the problem of “quadratic divergence” (See Fig. 7.1(a)). Thus it is hard to 
maintain the hierarchy naturally. Actually, it is possible to adjust the bare mass 
of the Higgs, so that the renormalized Higgs mass remain to be the weak scale. It, 
however, will require a fine tuning of the bare parameter at the precision roughly 
of (Mw / Mgut) 2 ~ 10 —26 , for instance, which we regard as unnatural. 

Supersymmetry had been a subject of the theoretical research for rather long 
time, but its application to the particle physics was strongly motivated by the desire 
to solve the hierarchy problem. In fact, supersymmetric SU (5) GUT was proposed 
by Sakai and Dimopoulos-Georgi (Sakai, 1981; Dimopoulos-Georgi, 1981) based on 
the motivation. Let us note that in the supersymmetric theory coupling constants 
of a particle and its superpartners are identical, and also that the Feynman rule 
provides an additional negative sign for the diagram with a loop of fermion. Thus 
the quadratic divergences from the two diagrams Fig. 7.1(a) and (b) are known 
to cancel with each other as long as the supersymmetry is exact. In this way, the 






Two-component Notation 


149 


hierarchy problem of the quadratic divergence is solved. As the matter of fact, 
the supersymmetry should be broken to the extent of “SUSY breaking mass scale” 
Msusy ■ We, however, are still free from the quadratic divergence, though we do 
have an logarithmic divergence, ~ aMg USY log We regard the logarithmic 

correction as the order of aMg USY ■ Then the SUSY breaking scale should not be too 
high, since otherwise the hierarchy problem will arise again, i.e. aMg USY < 

This leads to a condition Msusy < 1 (TeV), roughly speaking. 

It should be also stressed that another very important motivation to study the 
SUSY is that it almost automatically accommodates gravity. This is because the 
commutation relation of two infinitesimal SUSY transformations is just space-time 
translation. Thus, if SUSY is promoted to a local symmetry the resultant theory 
should be invariant under a local transformations of space-time coordinates, i.e. 
under the general coordinate transformations, thus leading to a supersymmetric 
gravity theory, supergravity. 

Discussions in this chapter is, in many respects, based upon the classic textbook 
on the supersymmetry and supergravity by Wess and Bagger (Wess and Bagger, 
1992). 


7.2 Two-component Notation 

Supersymmetry (SUSY) is in some sense a symmetry concerning (external) space- 
time coordinates, instead of internal symmetries; the fact that the commutation 
relation of SUSY transformations is space-time translation is the manifestation. 
Actually, as we will see later, SUSY transformation can be understood as a trans¬ 
lation in a Grassmannian space, “superspace”. Just as the Lorentz transformation, 
the SUSY transformation preserves (a little modified) chirality. Thus the most fun¬ 
damental representation of SUSY has a definite chirality, and the usage of chiral 
(Weyl) fermion, i.e. the usage of the two-component notation is useful. We thus first 
summarize the notation and some useful relations in the two-component notation. 

Weyl spinor, i/'a.l = 1 = * 2 75 V*. behave as irreducible representation of Lorentz 
group, since [S M1/ , 75] = 0, with being the generator of the Lorentz group. 
A Weyl spinor has a half degrees of freedom compared with a Dirac spinor and 
is expected to be described by a two-component spinor, in a suitable base of 7 - 
matrices. To see this explicitly, we employ the Weyl base of 7 -matrices, where 75 
is a diagonal matrix, i.e. the same base as shown in (6.1)-(6.4); 


7 m 

\ ° 

0 it> 

II 

(7.1) 

or" 


(7.2) 

o' 1 

= (-1, -<r»), 

(7.3) 
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75 = i7°7 1 7 2 7 3 


I 

0 



(7.4) 


where I is a 2 x 2 unit matrix and cr* (i = 1,2,3) are Pauli matrices. Then the Weyl 
fermions are described by 2-component complex spinors as we expected: 


Va 

0 


0 


, (dj h: 1 j 2). 


IpR = 

IpL = 

\ « / 

The Lorentz generators have the form of block diagonal 

(R # v), 


= - 
2 


a» v 0 
0 


(7.5) 

(7.6) 


(7.7) 

(7.8) 


which clearly means that the chirality is preserved under the Lorentz transforma¬ 
tion. It will be necessary to comment on the raising and lowering of the 2-component 
spinor indices a or d and the meaning of the dot. We first note that the right-handed 
Weyl spinor, r} a , is transformed by a Lorentz transformation as r)' a = M a 0 r} 0 , where 
the 2x2 matrix 


M = exp {ie^ v ), (7.9) 

with 6 transformation parameters e M „ (e„ M = — e p „). It should be noted that the M 
is an element of SL(2,C). In fact, since the generators are traceless detM = 
1 and also the 6 real degrees of freedom of e M „ just coincide with the degree of 
SL(2,C), 2 x 2 2 — 2. We can easily show that in SL(2, C) an invariant inner- 
product of two fundamental representations, i.e. doublets T) a and Xa, is given by 
e a ' i ri a xi) (c a/3 : 2x2 Levi-Civita tensor, e 12 = —e 21 = 1), by use of the property 
detM = 1. Thus the invariant product can be written as 

Va x a , (7.10) 

once the raising of the lower index is defined as 

X“ = (7.11) 

In other words, the Levi-Civita tensor e a/3 behaves as the metric tensor in the 2- 
component spinor space. Hereafter we will use the abbreviation, rjx = R a Xa = 
~VaX a = X a Va = X’h W = VaX a = XV, for convenience. 

Noting an identity <7 M " = —eo- M "*e, we find that the matrix of Lorentz transfor¬ 
mation for a left-handed Weyl spinor should have the following indices, 

(M*)4 (7-12) 
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where the dot of the index denotes the complex conjugation of the quantity. To 
be consistent, the left-handed 2 -component spinor should have the following upper 
dotted index 


IpL = 



(7.13) 


We may also understand the origin of the dot as the result of charge-conjugation. 
Namely, the charge-conjugation interchanges the chirality of Weyl spinors, e.g. 

= 0 , while the charge conjugation always has the action of complex 
conjugation. Thus, when the right-handed spinor has an undotted index the left- 
handed spinor should have a dotted index. More explicitly, the charge-conjugation 
of a Dirac-spinor tpo = ipR + ipL is given as, 


(<M C 


c v4> = rfipb = h 2 ^ ^ ^ 

/ 0 e P<* \ ( Va \ _ ( & 

0 ) ( r J V 1)13 


where q^ = (q a )* etc. This clearly shows that when £ = q the spinor 


(7.14) 


V'M = ( ^ , (7.15) 

is self-conjugate under the charge-conjugation, 

=1>m- (7-16) 


This specific “real” spinor is called Majorana spinor, and the particle described by 
the spinor is called Majorana particle. 

Once the assignment of the spinor indices of Weyl spinors is fixed, the assignment 
of the indices of various 2 x 2 matrices are determined as, 

(^)ad,K)“ a ,(Oa 01 etc. (7.17) 

The Lagrangians of renormalizable theories have bi-linear terms of spinors. Weyl 
spinors ipR and ipi behave as 2 and 2 representations of SL(2, C), or equivalently 
(|,0) and (0, |) representations of the Lorentz group, where \ and 0 are the “spins” 
of two SU ( 2 )-like groups, contained in the Lorentz group. We then ask how bi-linear 
forms of Weyl fermions behave under the Lorentz group. The bi-linear form of two 
Weyl spinors with the same chirality decomposes into (1,0) -f (1,0) or (0,1) + (1,0) 
representations, as is known from the recombination of the spin: | x 1 = 0 + 1 . 
More concretely, the singlet (0,0) and the triplet (1,0) representations formed by 
the product of two, e.g., right-handed spinors, qi and %, are written as 

t a0 qi a qn} = qiaVS - -Vim, {o^ v ) al3 qi a q-z0. (7.18) 
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The reader can easily check that {a ,lv ) a ^ is symmetric under the exchange of the 
spinor indices a and /?, the property necessary for the triplet. This statement, 
however, is a little puzzling, as the number of possible combinations of fi and v in 
the is 6, instead of 3. Actually, we find that not all of er M " are independent; for 
instance a 01 and a 23 are essentially identical matrices. (In fact, if our space-time 
were Euclidean, this tensor satisfies the self-dual condition cr^" = I e # “" tA <7 KA , while 
satisfies the anti-self-dual condition.) Similarly, the product of two left-handed 
spinors decomposes into (0,1) + (0,0) by the contractions with (<5 r/t ") (i/ g and e Aji . In 
the four component-notation, where the right-handed spinors are written as ipi = 
{Via,0y and V ; 2 = (h 2 a,0) f , the singlet and the triplet discussed above correspond 
to the following bi-linear forms, 'tp\Cip2 and Then what corresponds to 

the Lorentz vector V , i 7 A1 V , 2 ? (Note that tp[ Cip2 identically vanishes.) This time 
the bi-linear form is made by the mixture of the dotted and undotted indices, i.e. 
~Via {v* l ) 0 ‘ 0 ‘ T l 2 cf Namely, (|,0) x (0, |) = (|, |) behaves as a Lorentz vector with 
4 components. 

We display in the Problem 7.1 some of formulae in the 2-component notation, 
which are frequently used and useful in the discussion below. 


7.3 SUSY Algebra and SUSY Group 

As already stated, supersymmetry (SUSY) is the symmetry under the exchange of 
bosons and fermions. Remarkably, it is not only a possible symmetry, but also is a 
unique symmetry, consistent with relativistic quantum field theory, besides internal 
symmetries such as gauge symmetry. 

It had been known that generally the Lie algebra of possible symmetries of the S- 
matrix is restricted to the one composed of P M , M m „, and Lorentz scalar generators 
which are the generators of 


space-time translation, 

Lorentz transformation, 
gauge transformation, 

respectively. Supersymmetry, therefore, seems to be not allowed as a symmetry. 
This no-go theorem (Coleman and Mandula, 1967), however, has been proved to be 
evaded, once the Lie algebra is extended so that it may contain anti-commutation 
relations as well as commutation relations, i.e. to an graded Lie algebra (Haag, 
Lopuszanski and Sohnius, 1975): the algebra now takes a form 

Q,Q’ = X, [X,X'] = X", [Q,X] = Q",..., (7.19) 

where Q etc. and X etc. are, say, fermionic and bosonic generators. Since SUSY 
transformation connects fields whose spins differ by 1/2, the generators of SUSY 
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naturally carry spin 1/2, and therefore satisfy the anti-commutation relation due 
to the Fermi-statistics, just as Q etc. do in (7.19). 

Through such rigorous argument, it turns out that the SUSY algebra is given 
as 

{Qa, Qa) = -2K)«d Pp, (7-20) 

{Qa, Qp} ~ { Qa ) Qp} — 0, (7-21) 

[Pp,Q a \ = [Pp,Qa]= 0, (7-22) 

[Pp,Pv\ = 0, (7.23) 

where Q a and Q„ are complex conjugate, or hermitian conjugate in the sense of 
operators, of each other, and are Grassmannian operators, as is seen typically in 
(7.21). One may wonder why the anti-commutator in (7.21) should vanish, while 
that in (7.20) is non-vanishing. Just as the product of two doublets \ x | decomposes 
into a triplet and a singlet 1 + 0 in the calculus of spin, if {Q a , Qp} were non¬ 
vanishing, it generally should have contained two parts C{ a ,/ 3 } and C[ a ^, i.e. the 
parts symmetric and anti-symmetric under the exchange of a and /3, belonging 
to (1,0) and (0,0) representations of the Lorentz group, respectively. The anti¬ 
symmetric part, however, trivially vanishes, since { Q a , Qp} is symmetric under 
the exchange. Let us note that this situation changes in the case of extended SUSY 
with N (N > 1) kinds of SUSY transformations generated by Q l a (and Qpp), 
where a “central charge” term of the form e a p c^' J 1, symmetric under the exchange 
(a, i) «-» is allowed to exist. In the case of simple SUSY N = 1, howver, 

the only possibility is the symmetric piece C{ a< py, proportional to (<r ^ v ) a p 
A rigorous argument leads to the conclusion that this term is not allowed to exist, 
being consistent with the SUSY algebra [P M ,<? a ] = 0. 

We thus know that the commutator of SUSY transformations provides a space- 
time translation, i.e. 

[(e a Q a ), (£*0*)] = [(eQ), (eQ)] = -2( e “K) aA e“) P M , (7.24) 

where e“, e“ are Grassmannian transformation parameters for SUSY and = 
e Q (ff M ) QC ve“ denotes the amount of the infinitesimal space-time translation. There¬ 
fore, it is a rather trivial fact that once SUSY is promoted to a local symmetry the 
resultant theory inevitably has a local translational invariance, i.e. the invariance 
under the general coordinate transformations, thus leading to a SUSY theory in¬ 
cluding gravity, i.e. supergravity. In this book, we will restrict our argument to the 
global (rigid) supersymmetry where the transformation parameters are space-time 
independent. 

Having the SUSY algebra, we may define a SUSY group, which is a set of trans¬ 
formations generated by the SUSY algebra, e I (» l ‘p, 1 +fQ+<Q) ^ w h ere the generators 
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and Q are the elements of SUSY algebra and <T, e, l are the corresponding 
transformation parameters. We need representations of SUSY algebra, on which 
the transformations of SUSY group should act, so that we can assign our quarks 
and leptons and gauge bosons to some of the representations. In the SU(2) gauge 
theory, for instance, the representations are concretely expressed as multiplets, such 
as doublet, triplet, etc., on which the group elements act in the form of matrices. 
Similarly, the representations of SUSY group can be expressed as “SUSY multi¬ 
plets”. As the SUSY transformation connects fields whose spins differ by 1/2, it 
is expected that each multiplet is made of “super-partners” of each other, such as 
a (spin 0) scalar and a (spin 1/2) fermion. SUSY generators Q and Q commute 
with p 2 . Thus all members of a SUSY multiplet on their mass-shell should have a 
unique mass, and the SUSY multiplet can be classified by the mass M ( p 2 = M 2 ), 
Let us first consider massless (M = 0) SUSY multiplets. The massless multiplets 
we discuss in this book are 

“chiral multiplet” : (A, iPl,r) (a = 0, s = 1/2) (7.25) 

“vector multiplet” : (U M , A) (s = 1, s = 1/2), (7.26) 

where s denotes the spin. A is a complex scalar field and A should be a Majorana 
fermion, corresponding to the real vector field V M . The chirality of the Weyl fermion 
of the chiral multiplet can be either L or R. We will be able to assign our quarks 
and leptons and Higgs to the component fields of the chiral multiplets, together with 
their super-partners. Gauge bosons, such as photon, are assigned to V fl together 
with their superpartners, such as “photino”. The important feature of the SUSY 
multiplets is that the (real) degrees of freedom of boson and fermion fields just 
coincide. This, however, seems to be not the case in the above multiplets; for 
instance, in the chiral multiplet (A, iPl,r), the complex scalar clearly has 2 degrees, 
while the Weyl fermion ipL seems to have 4 real degrees of freedom, at the first sight. 
This superficial puzzle may be resolved if we realize that the on-shell fermionic state 
has just the half degrees of freedom, due to the equation of motion. For instance a 
massless Weyl fermion ipR = (r] a , 0) 4 with 4-momentum = (p, 0,0, p) satisfies a 
Dirac (Weyl) equation in the momentum space 

= 0 »{~o P o)UMo)' <7 - 27) 

which shows that the tji disappears and only rji remains. This argument, in turn, 
suggests that the SUSY algebra is closed only for on-shell states, or only by use of 
the equations of motion. In order to make the SUSY algebra close even for off-shell 
states, we therefore need to introduce additional bosonic states. Such additional 
states, i.e. 1 complex scalar field in the case of chiral multiplet and 1 real scalar 
field in the case of vector multiplet, should not have physical degrees of freedom, 
since they disappear once the equations of motion are applied. Thus these fields 
are called “auxiliary fields”. 
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Concerning the SUSY multiplets of (on-shell) massive states with M / 0, the 
situation is rather different from the massless case. A general argument (see for 
instance (Wess and Bagger, 1992)) shows that the multiplets are generally “larger” 
than those in the massless case. Let us take the example of a massive SUSY mul- 
tiplet with the highest spin 1. This multiplet has one s = 1, two s = 1/2 and one 
s = 0 component fields, though the SUSY transformation changes the spin of fields 
only by 1/2. This may not be so surprising if we recall the Higgs mechanism in 
ordinary gauge theories. For a gauge bosons to be massive, it needs to absorb a 
scalar field. Thus for a vector SUSY multiplet to be massive, it needs to absorb 
a chiral multiplet, which has inevitably a scalar component field. One remarkable 
feature of the SUSY theory is that, at the same time as the gauge boson becomes 
massive absorbing the scalar, gauge fermion (a superpartner of gauge boson, “gaug- 
ino”) absorbs the chiral fermion of chiral SUSY multiplet to form a massive Dirac 
fermion (super-Higgs mechanism). For instance, suppose that the vector multiplet 
(Vfi, A) is made of the photon and “photino”, and the scalar component A of the 
chiral multiplet (A, 4>l,r) is a charged scalar. Assuming that the real part of the 
A develops the vacuum expectation value (VEV) by a suitable scalar potential, the 
photon will absorb the imaginary part by the Higgs mechanism. Thus, we have a 
massive photon with 3 degrees of freedom and a massive scalar of 1 degree, i.e. the 
real part. As for the fermionic part, due to the super-Higgs mechanism, the photino 
A and ip form a massive Dirac particle, which should be the super-partner of the 
massive photon. 


7.4 Superfield Formulation 

We have formulated SUSY algebra and its representation, SUSY multiplets. The 
next task is to find the rule of SUSY transformation among the component fields 
of a SUSY multiplet, which is non-trivial. A powerful technique for such purpose 
has been invented, which makes the formulation of SUSY transformation and the 
construction of supersymmetric theories, i.e. the theories invariant under the SUSY 
transformation, almost automatic. 

The formulation is based on the notion of “superfields”, which are the fields 
defined on “superspace”. Before going into the detail, let us recall that the gen¬ 
erator of space-time translation, acts on an ordinary field </>(x M ) as a differ¬ 
ential operator p^cpix* 1 ) = Since (SUSY) 2 ~ space-time translation, 

roughly speaking, and a Lorentz vector can be expressed as a bi-linear form of a 
spinor, = e a {a^) a af a as suggested by (7.24), it will be natural to expect that 
the generators of SUSY transformations Q and Q act as a sort of translations 
in a “fermionic space”, with anti-commuting Grassmannian coordinates 6 a , 6^ 
({#<*,$/?} = {#«>#<*} = {6dnQa} = 0). The space with orninary space-time co¬ 
ordinate and the Grassmannian coordinates, (xA, 9 a , £?<*), is called superspace, and 
a field defined on the superspace is called superfield. Actually it turns out that the 
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SUSY generator is not a simple translation of 0 and 9 coordinates, but contains 
a translation of as well. Then a natural question is what is the differential 
operator of SUSY generator, corresponding to = i gf^-? 

Sometimes it is useful to regard a space (or space-time) as a “group manifold”, 
which is a space, whose point can be identified with an element of the group. For 
instance, a point on the Earth, i.e. a 2-dimesional sphere 5 2 , may be obtained as 
the result of the action of 3-dimensional rotation group 50(3) of the point on the 
north-pole. Hence the point can be identified with the element of 50(3) group. 
This identification is not unique, as the rotation about the z-axis, 50(2), does not 
move the north pole and can be freely added; 5 2 may be regarded as a group man¬ 
ifold of 50(3)/50(2), whose degree is 3 — 1 = 2, the same as that of 5 2 . More 
explicitly, a point P( 8 ,4>) with polar coordinate ( 8 , <j>) and Cartesian Coordinates 
(sin#cos</>, sin0sin</>, cos#) on a unit 5 2 embedded in a 3-dimensional space is iden¬ 
tified with an element of 50(3), G(9,(j),a) = R 3 {—<j))R 2 {— 9 )R 3 {a) with (see also 
(4.179)) 


( Ci 0 -Si \ f Ci Si 0 \ 

0 10, R'i(di) = -Si Ci 0 , (7.28) 

Si 0 ^ J \ 0 0 1 / 

where Ci = cos 9i and Sj = sin 8 i . In fact, 

/ sin0cos</> \ / 0 \ / 0 \ 

f sin0sin</> j = R 3 (-<t>)R 2 (- 8 )R 3 (a) f 0 j = R 3 {-<t>)R 2 (-0) ( 0 1 , (7.29) 

where we find that the 50(2) rotation with angle a is irrelevant. Let Q(9', <p') be a 
point on the 5 2 , which is obtained by the action of a 50(3) rotation G(/t, A, /?) on 
the point P(8, <f>). Then how can we express the transformation of space coordinates 
(9, <fi) —» (< y,<f>') by use of /c, A,/? ? We note 


( sin#'cos<^' \ / 0 \ / smdcos(p \ 

sin^'sin^' j =G( 6 ',(j> 1 , 7 ) I 0 j = G{k, A, 5) I sindsiiuf) J 
cosd 1 ) \ 1 / V COS0 / 


= G{k, \, (3){G{8, (j),a) 


°\ 

0 j} = {G(K,A,/?)G(M,a)} 



(7.30) 


This means the transformation of the space coordinates is completely determined 
by the multiplication rule of group elements: 0 ( 0 ', </>', 7 ) 

= G(k, A, 5)0(0, (j>, a). Such determined transformation automatically satisfies all 
properties of 50(3) group, by construction. 

Similarly, we can establish the transformation of superspace coordinates ( x , 0,0). 
Let O(a: M ,0,0) = e l ( x ^p^+ e Q+ 0 Q) be an element of SUSY group and identify it with 
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a coordinate (x M , 0, 9). By use of the SUSY algebra (7.20)-(7.23) and the well- 
known relation e A e B = e A+B+±[A,B]+ ••• we g et ( s j nce higher commutators do not 
contribute) 

G(0, e, e) ■ G(x^ ,0,6) = G(x» + iOa^e - ie^O, 0 + e,6 + e). (7.31) 

^From the identification we readily get a transformation property of the superspace 
coordinates due to the action of G(0,e,e), 

(a^M) -> (x* 1 + i0a>*e-iea tt 6,6 + e,6 + e), (7.32) 

where Oa^I = 6 a {o^) ar - x f a , etc. Thus we can conclude that the action of the SUSY 
transformation G(0, e, e) on a superfield is equivalent to the application of e*( c, 3+ £C 2) 
with the following differential operators, corresponding to the (i times) SUSY gen¬ 
erators, 

p 

% = (7-33) 

p 

iQi = + iO a ^) a6t d^. (7.34) 

do 

The nice thing to use the group-manifold approach is that such constructed differ¬ 
ential operators automatically satisfies the SUSY algebra, just because it is based 
on the group element itself. For instance if G 1 G 2 = G 3 holds among the elements 
of SUSY group, multiplying G(x IL ,0,0) from the right of both sides and identifying 
G{x^,6,0) as a superfield <j> and Gi,G 2 ,G 3 with the corresponding group elements 
with differential operators, we immediately get GiG2 <1> = G^cj). In fact, we can 
directly confirm that the differential operators (7.33) and (7.34) satisfy 

{Qa,Q cJ = -2i((T'%4 0 M , {QonQfi} = {Qa,Q^} - 0 , (7.35) 

to be consistent with (7.20) and (7.21). 

We will now see how the component fields of SUSY multiplets are contained 
in the superfields. Superfields containing the irreducible representation of SUSY 
algebra, i.e. (massless) chiral multiplet and the (massless) vector' multiplet, are 
called chiral superfield and vector superfield. Before going to the discussion of 
each superfield, we first consider the most general complex superfield, <j>(x,6,6) 
(the Lorentz and spinor indices of the coordinates have been suppressed). The 
important feature of the superfied is that, when it is Taylor-expanded with respect 
to Grassmannian coordinates 0, 0, it contains only a finite number of terms, whose 
coefficients are functions solely of x, i.e. ordinary fields. This termination of the 
expansion is due to the anti-commuting property of the Grassmannian coordinates; 
e.g. 6 a 6p6 1 = 0 as the spinor indices a, [5 ,7 take only 1 or 2. Thus the general 
superfield can be expanded as 

(j>{x,6,6) = C(x) + 0x(x) + Ox'(x) + 00M(x) + 00N(x) 
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+ ea^ev^x) + eeex(x) + eee^x) + eeeeD(x), (7.36) 

where 69 = 0 a 8 a etc.. The bosonic component fields, C, M, N, V/, D, have in total 
(o) + ( 2 ) + (4) = 8 complex degrees of freedom, while the fermionic component 
fields, x> X'> A, 0, have (f) + (3) = 8 degrees of freedom. Thus we have checked the 
coincidence of bosonic and fermionic degrees of freedom, as expected from SUSY. 
Altogether we have 8 + 8 = 16 degrees of freedom. This may be easily under¬ 
stood, since each term of the Taylor expansion can be written, in general, in a form 
Cijki(x) (tfimmm)'. We have 2 4 = 16 choices of the coefficient functions 
Cijki(x), as take either 0 or 1. 

The deviation of the superfield i5<f>(x ) = — <fi{x), under an infinitesimal 

transformation, cp'(x) = exp(i(eQ + eQ)) with infinitesimal Grassmannian 

parameters e and e can be easily calculated by use of the differential operators 
(7.33) and (7.34) as 

r\ _ r\ 

<50 = i{eQ + eQ)<j> = {e“(^ - d^) - e“(-^ + *6»“(cr M ) a( i5 AI )}0(a:). 

dO a Qf 

(7.37) 

Writing 5(j>{x) = 5C(x) + 06x(x) + • • •, the infinitesimal deviation of some of com¬ 
ponent fields can be read off as 


<5(7 

= <^X + e X. 

(7.38) 

<5 Xa 

= 2 e a M+ {o»)aaf & {V»+id fJ ,C), 

(7.39) 

<5 

= 2 + e“K)a “(Vm - ifyC), 

(7.40) 

SD 

= !K) aA (e Q d M r - {d^rrn- 

(7.41) 


We learn from the above exercise the following things, which play quite impor¬ 
tant roles when we attempt to construct a supersymmetric theory, i.e. a theory 
invariant under the SUSY transformation. 

(a) The dimensionality of the component fields 

As is suggested by (7.31), the bi-linear form of the Grassmannian coordinates 
8 a and 6 (l behaves as the ordinary space-time coordinate x fl . Hence, the 8 a and 
8 a should have mass dimension d = —1/2. The generators of SUSY transformation 
Q a and Q & therefore have mass dimension d = 1/2. This means by the SUSY 
transformation a component field with mass dimension d is transformed into a field 
with mass dimension d + \ or the space-time derivative of a field of lower mass 
dimension d - |. We also learn that the component field accompanied by a higher 
power of 8 a and da has higher mass dimension. 

(b) The property of D-term 

The above argument suggests that the “highest component” D(x), or “D-term”, 
proportional to the highest power 6688 , has the highest mass dimension. For in- 


Pidu* /clclL 'iA 



Superfield Formulation 


159 


stance, if we assign the Lorentz vector field V )L the mass dimension d = 1, the D field 
has d — 2. As the quadratic term of D(x) already has d = 4, we cannot include any 
space-time derivative in the free Lagrangian of the D field, thus making the D an 
auxiliary field, i.e. a field without dynamical degree of freedom and is describable 
in terms of other physical fields, once the equation of motion for the field is used. 

As the D field has the highest mass dimension, the infinitesimal deviation of 
the D(x) under the SUSY transformation, is inevitably the space-time deriva¬ 
tive of the field of lower mass dimension. In fact 6 D in (7.41) can be rewritten 
as a total derivative, noting that e and e are space-time independent constant: 
SD = d ll {%e a (cr' 1 ) a a\ a - ^“(o^ )<*<*£“}• This means that if the Lagrangian of a 
theory can be regarded as the D-trem of some superfield, the action is supersymmet¬ 
ric, since a total derivative term of the Lagrangian is irrelevant. (If space-time has 
some compact dimension, as usually assumed in higher dimensional theories, the 
derivative term may not be ignored for non-trivial boundary conditions of fields.) It 
is suggestive to note that a similar thing happens when a theory is invariant under 
a space-time translation, i.e. also in this case the Lagrangian itself is not invariant 
under the translation, but the deviation can be written as a total derivative, thus 
making the action invariant. 

(c) The property of closure 

To find out the multiplication rule of SUSY multiplets, namely a rule to re¬ 
construct a new SUSY multiplet out of the product of two SUSY multiplets, is 
a non-trivial task. In the language of the superfield, however, the multiplication 
rule is quite simple, i.e. we can just multiply two superfields. If 4>i and d> 2 are two 
superfields, their product <f> 3 (x,9,9) = cj>i(x,9,9)-(j> 2 {x,9,9) can be Taylor expanded 
again, and therefore behaves as a new superfield (“closure” property of superfields 
under the multiplication). Each component field of <p 3 is written as a bi-linear term 
of the component fields of <j>\ and <f> 2 , which provides us the multiplication rule. 
This is essentially due to the fact that the SUSY generator is a linear differential 
operator and the chain rule i{eQ + IQ)4> 3 = {i(eQ + eQ)<pi } -^ 2 + <j>\ • {i(eQ + eQ)<f> 2 } 
holds. 

By use of these properties, we can easily construct (some part of) the action of 
a supersymmetric theory, 


S = J d 4 i (7.42) 

where </> 2 > • ■ -)|£> denotes the D-term (the highest component) of the polyno¬ 
mial function / of superfields, <j>i, 4> 2 , • • •, which is again a superfield. The infinites¬ 
imal deviation of the D-term is a total derivative and the action is SUSY invariant. 
Since SUSY transformation is a sort of translation in the superspace, it is naturally 
expected that the integral of the Lagrangian over the entire superspace yields a 
SUSY invariant action. In fact, we find that the action can be equivalently written 
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as 

S = J d*x d 2 9 d 2 e (7.43) 

Let us note that the integration over Grassmannian variables is equivalent to the 
differentiation in terms of these variables. Thus d 2 6 d 2 6 is equivalent to extracting 
the D-term. 


7.5 Chiral superfield 

The general superfield turns out to be reducible representation of SUSY, as it con¬ 
tains fields with spin s = 0,1/2,1 together, while SUSY transformation changes the 
spin only by 1/2. It is thus important to identify the irreducible representations of 
SUSY. There are two kinds of irreducible representation, i.e. chiral superfield and 
vector superfield, which contain, as their component fields the chiral multiplet and 
the vector multiplet, respectively. 

In this section, we will discuss the chiral superfield, which contain component 
fields with s = 0 and 1/2. Naive guess is that we can get it by adopting a “chiral” 
superspace, whose Grassmannian coordinate is restricted to a Weyl fermion with 
definite chirality, 9 or 8, but not the both, e.g. space with coordinates (x^,^). 
Then the power series expansion with respect to 6 a will terminate at the quadratic 
term 89, and the appearance of the s = 1 vector field V/ will be evaded. Thus 
we are tempted to impose a constraint in order to define, e.g., a “right-handed” 
chiral superfield 0, 0 = 0, so that 0 does not depend on 8. This constraint 

equation, however, does not work, since the condition is not compatible with SUSY, 
i.e. {Q a , gfr} 7^ 0. We then have to find out a suitable differential operator, which 
(anti-)commutes with the SUSY generators. The desirable operators turn out to be 


r\ 

Da = + i((T^) a a 8 a <9 m , 

(7.44) 

r\ 

D« = ~—-i9 a M ai 
dT 

(7.45) 


It is straightforward to check that these operators anti-commute with SUSY gener¬ 
ator, 

{Q a ,D 0 } = {Q & ,D a } = {Q a ,D & } = {Q & ,Dfi} = 0. (7.46) 

Let us note that these “SUSY-covariant” derivatives D a and are quite similar 
to iQ a and iQa in (7.33) and (7.34), except the sign in front of the space-time 
derivative. This is not an accident. Let us recall that the differential operator Q 
and Q were read off from the relation, G(0, e,e) ■ G(x^,9,8) = G(x M + iScr^e — 
iea^8,9 + e,9 + e), where we identified the “left multiplication” of G(0,e,e) as 
an action of the element of SUSY group. What happens if the multiplication of 
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G(0, e, e) is from the right? The only change will be that the sign of the space-time 
translation, proportional to e or e, is just opposite to the case of left-multiplication, 
i.e. —ida^e + im tL 9. Thus we find that the operators D and D can be regarded 
as the differential operators corresponding to the right-multiplication. It is now 
trivial that they commute with the SUSY generators, just because the right- and 
left-multiplications are mutually independent. 

Now, for instance, a right-handed chiral superfield 0 can be defined as 

Da </> = 0. (7.47) 

(The left-handed superfield is defined similarly by replacing Da by D a .) As ex¬ 
pected, Da@ a = 0 means that the superfield 0 may depend on the right-handed 
coordinate 9 a . The space-time coordinate however, is not a suitable coordinate 
to describe 0, as = —i 9 a (a fl ) a a does not vanish. We note that the bi-linear 
form of 9 and 9 behaving as a Lorentz vector 6a^9 satisfies Da(9cr^9) = 9°‘(a ,1 )aa- 
We thus find the suitable Lorentz vector to describe 0 is y^ = + i (9a^9): 

= 0. In this way, the chiral superfield should be a function of coordinates 
(y'\ 6a), and can be expanded in a power series of 6 as 

0(y, 9) = A(y) + a/2 9ip(y) + 99F{y) 

= e i{6a ^ 9 » (. A(x ) -I- V2 9ip(x) + 99F(x )) 

= A(x) + i {9^9) d^Aix) - i 9999 □ A(x) + Vl, 9rt>(x) 

- —=ee {d^a^ + eeFix), (7.48) 

V2 

where □ = d^d fi and we have used the property that the power series expansion 
of e l ( 9tr>le ) d e ends up at {(9o tl 9)d tl } 2 , because of the anti-commuting property of 6 
and 9. In the derivation, some relations listed in Problem 7.1 may be useful. The 
component field F has appeared in addition to the expected scalar and fermion 
fields A and ip. The field F has mass dimension d = 2, and is anticipated to be an 
auxiliary field, just as the D field in a general superfield. Let us note that in the 
third line of the right hand side of (7.48), derivative terms have appeared though 
there is no derivative of the chiral superfield at the beginning. This is due to the 
“non-locality” of the coordinate which has a deviation i(9a fl 9) from and will 
play an important role in the construction of the kinetic term for the component 
fields. 

In terms of the coordinates ?/ M , 9 a , the infinitesimal SUSY transformation reads 
as 

i{(eQ + eQ)} cp(y, 9) = ( e “^ + 2*0 a (o- M ) O£i e“<9 M ) 0(y, 9), (7.49) 

where = d/dy^ 1 . It is now easy to derive the transformation property of each 
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component field: 

SA = 

a/2 op, 

(7.50) 


dtp = 

\/2i(j ,1 ed )1 A + a/ 2 eF, 

(7.51) 


SF = 

— a/2 iar^dfjip. 

(7.52) 


We realize that again the deviation of the “highest” component field F can be 
written as a total derivative (as e is ^-independent), which becomes important in 
the construction of supersymmetric theory of chiral superfields. 

In order to see the multiplication rule for chiral superfields, it is suggestive 
to recall that multiplication of holomorphic functions f(z) and g(z) gives another 
holomorphic function h(z) = f(z)g(z), while the multiplication of holomorphic and 
anti-holomorphic functions is no longer holomorphic nor anti-holomorphic, but is a 
general complex function. A complete analogy holds in the multiplication rule for 
chiral and anti-chiral superfields, i.e., 

chiral x chiral = chiral, 

anti-chiral x anti-chiral = anti-chiral, (7.53) 

chiral x anti-chiral = general. 

Thanks to the multiplication rule, especially the closure property among chiral 
or anti-chiral superfields, it is quite easy to form a SUSY invariant monomials of 
such superfields, 

J d i y{^>i{y, 0 )(l> 2 {y, 0 )---(j) n (y,e)}\ F ~ J d 4 yd 2 6{4>i<p 2 ■ ■ ■ <t>n}, (7.54) 

where F denotes the extraction of the F-term of the chiral superfield obtained by the 
multiplication of the chiral superfields <j)i(y,9) etc., which is a total derivative with 
respect to This procedure will be used in the formation of the “superpotential” 
which is discussed below. In (7.54), we may finally replace by the ordinary 
space-time coordinate x fl . 


7.6 Wess—Zumino Model 

So called (j) 4 theory is often discussed as a prototype theory of quantum field theory 
for the purpose of getting some essential ingredients of the quantum field theory. In 
this section, we discuss the supersymmetric extension of the </> 4 theory, a supersym¬ 
metric model of chiral multiplet (A,, -ipi) (i = 1,2, • • •), called Wess-Zumino model. 
As the new feature of the Wess-Zumino model, we have not only self-interaction of 
scalar fields but also Yukawa-type couplings between scalars and fermions. As the 
consequence of the SUSY, the coupling constants of such interactions are identical, 
and there should be mass degeneracy between the scalars and the fermions, unless 
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there is SUSY breaking. 


• The kinetic term 

To get kinetic terms for scalars and fermions, it is clear that we need quadratic 
terms of chiral superfields. We learn from ordinary field theories that kinetic terms 
are those which are hermitian by themselves. We thus expect the multiplication of a 
chiral superfield and its hermitian conjugation, i.e. a anti-chiral superfield, provides 
the desirable kinetic term. This turns out to be the case, since the difference between 
y' 1 and yields the non-locality, and therefore the kinetic term. (As already has 
been pointed out and we will see below more concretely, a simple square of a chiral 
superfield just provides a quadratic term in “superpotential”, but not a kinetic 
term.) In fact, the multiplication yields for a given chiral superfield <p 


</’(y,0) t 0(y>0) 


= (e </>(x,0 ))t (eW'* 9 )** 4>{x,6)) 

= A(x)*A(x) -I- y/26tp(x)A(x)* + y/2(hp(x)A(x) + • • • 

+ 09§S[-^A*D A - i(DA*)A + i(0 M A*)(dM) 

+ + F*F]. (7.55) 


According to the multiplication rule, this product behaves as a general superfield. 
Thus taking the D-term we readily obtain the desirable SUSY invariant kinetic term 
for the set of chiral superfields: 


Ufcin — D 

= (df,A*)(d M Ai) - + F*Fi + (total derivative). (7.56) 


As we expected from dimensional counting, the field F has no derivative in this 
“kinetic” term and therefore is an auxiliary field. Namely, F does not propagate by 
itself, unless it couples with propagating fields through interactions. The remaining 
physical fields, A and have just ordinary forms of kinetic terms. 


• The superpotential 

The aforementioned kinetic term is a free Lagrangian and does not contain any 
interactions. Let us now consider the self interaction of chiral superfields. Just 
as the self-interaction of scalar fields A* are described by scalar potential V (A), 
supersymmetric “self-” interaction is generally describable in terms of a polynomial 
of chiral superfield W {(p ), called superpotential: 

kF(</>) = T 2 m i,j4 > i < Pj T ~Q9i,j,k < Pi ( l > j ( Pk ■ (7.57) 

^From the closure property under the multiplication, the superpotential itself should 
be a chiral superfield. Thus f d 4 x W{4 >)\f (after the replacement of j/ M —> aA) should 
be SUSY invariant. Let the mass dimension of W ( <f >) be dw ■ Then its F-term should 
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have a mass dimension d + 1, as 69 carries d = 1. This is why the superpotential 
ends up at cubic terms of chiral superfields. Namely if we introduce terms higher 
than cubic, such as quartic terms, the mass dimension of the F-term should exceed 
4, thus leading to a non-renormalizable theory. We will see below, more concretely, 
that such cubic superpotential eventually yields quartic self-interaction of scalar 
fields and Yukawa-type couplings. 

Let us now calculate the F-term of the superpotential W. There is a sys¬ 
tematic way to derive the F-term. We first calculate the F-term of a monomial 
(4>i<p2 ■■•<j>n)\F- The F component is the sum of the terms, which are quadratic 
in the Grassmannian coordinates 9. To get the quadratic terms there are only two 
possibilities. One possibility is to pick up Fi component from some chiral superfield 
(pi and take scalar components from all remaining chiral superfields: 

A\ ■ ■ ■ A n -\F n + Ai ■ ■ ■ A n ^2F n -iA n + ■ ■ ■ = —-— 1 ^- — Fi. (7.58) 


The right hand side means that such term can be compactly expressed in terms of 
the first derivative of the monomial, obtained by replacing all chiral superfields by 
their scalar components. The next possibility is to take fermions fron two different 
places of the monomial and take scalar fields from all remaining chiral superfields: 


A\ ' ' ' VVi— l^na Ai ■ ■ • V’n—2 A n —i1pna 


1 d 2 (A 1 A 2 ---A n ) 

2 dAidAj 


Ipi Tpj- 


(7.59) 


As this compact way of writing is easily known to be valid for arbitrary monomials 
and because differential operators are linear operators, this procedure is applicable 
for an arbitrary superpotential. Hence, the F-term of the superpotential can be 
written as 


TO) |i 


dW(A) 

dAi 


r, 1 d 2 W(A) , ( 


(7.60) 


The whole Lagrangian now reads 


C. = C kin -{W{cj,)\ F + h.c.} 

= (diiA*)(d^Ai) - hP^d^i + F*Fi 


- { 


dW(A) 

dAi 


1 d 2 W(A) 

2 dAidAj 


tpiipj + h.c.}. 


(7.61) 


The auxiliary field does not possess an independent dynamical freedom by itself, 
and is expected to be eliminated from the Lagrangian, if we wish. In fact, by use 
of Euler-Lagrange equation for Fi, F\ can be expressed in terms of scalar fields, 
as there is no derivative term of the auxiliary field in the Lagrangian. For such 
purpose we first write down the terms containing Fi fields in C, 


F? Fi - ( 


dW{A) 

dAi 


Fi + h.c.) = (F* - 


dW{A) s ,„ r <9W(A) ,dW(A) 
~MT ){Fi ~ { ~dA~ ) )-I ~3aT 


(7.62) 
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By use of the Euler-Lagrange equation, the quantities inside the bracket just Dis¬ 
appear: 


dW{A) 
i ~ dAi 


Hence, the (ordinary) scalar potential V(A) is given as 


V{A) = | 


dW(A) 

dAi 


(7.64) 


In this way the Lagrangian of the Wess-Zumino model can be rewritten in terms 
of only physical fields as 



(d.A^A^-i^d^i 
1 ,d 2 W{A) , , , . 

> dW{A) 2 
1 5A, 1 ’ 


(7.65) 


The first line at the right hand side is ordinary kinetic term and the second and the 
third lines denote Yukawa coupling and scalar potential, respectively. 


• Solving the hierarchy problem 

To see how SUSY works as a mechanism to solve the hierarchy problem or 
the problem of quadratic divergence, we will consider the Wess-Zumino model of a 
single chiral multiplet, with superpotential W (</>) = <j> 2 — \ g<t > 3 • The Lagrangian 

is explicitly written as 

C = (d li A*)(d fi A) - hpa^dfj.ip 

+ i(\ m ~ 9^) i/> 2 + h.c] 

- \mA — gA 2 \ 2 . (7.66) 

Thus we know that the complex scalar A has a mass m and the fermion ip has 
the same Majorana mass m. Thus the degeneracy of the masses of boson and 
fermion is realized, as an important consequence of SUSY. We further note that the 
Yukawa coupling Aip 2 and the scalar self-interaction (A* A) 2 have identical coupling 
constant g. This is another important consequence of SUSY. One remark here is 
that the scalar self-coupling is proportional to g 2 , not g as in the Yukawa coupling, 
and SUSY seems to be not manifest. One may convince oneself that SUSY is really 
maintained, by noting that the self-interaction is originally due to the coupling 
between scalar A and auxiliary field F, 2 gFA 2 + h.c., whose coupling constant is 
proportional to g, thus preserving SUSY (see Fig. 7.2). 

As was mentioned in the beginning of this Chapter, SUSY has a remarkable 
feature to stabilize the scalar mass under the quantum correction, i.e. the problem of 
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Fig. 7.2 


quadratic divergence is absent in SUSY theories. Let us now explicitly show that the 
cancellation of the quadratic divergence does happen in the Wess-Zumino model. 
What we calculate is the Feynman diagrams (a) and (b) shown in Fig. 7.2. In order 
to calculate the diagram with ^-exchange, we need a propagator for the fermion ip. 
Since we are familiar with the propagator for 4-componet spinor, and as ip has a 
Majorana mass term \mip 2 + h.c ., let us define a Majorana fermion ipM = ip + (ip ) c , 
with ip being understood as a 4-vector (ip a , 0)*, whose free Lagrangian is given by 
\ ip — rri) ipM ■ In terms of ipM , the relevant Yukawa coupling can be written 

as —gAip 2 + h.c. = gAipM l+ 2 ^ ipM + h.c. According to the discussion in Chapter 6, 
the propagator of Majorana particle ipM is just the same as the case of ordinary 
Dirac fermion, except that not only ipMipM but also iPm^Pm type propagator exist, 
due to the fact ( iPm) g = ipM (see (6.40) and (6.41)). Paying attention to this 
specific feature of the Majorana particle, we may calculate Fig. 7.2(b) according to 
the ordinary Feynman rule. We can directly check that there is a cancellation of the 
quadratic divergence between the diagrams (a) and (b). Actually, we find that for 
to = 0 the sum of these two diagrams identically vanish. We may also check that 
when SUSY is broken (spontaneously or explicitly) and only A gets a mass Msusy, 
denoting the extent of the SUSY breaking, there remains a logarithmic divergence 
in the sum of these diagrams, i.e. (for m = 0) oc Mg USY log A. This is why Msusy 
should not exceed 1 (TeV) or so, since otherwise the hierarchy problem will arise 
again. Let us note that another type of diagram coming from the contraction of 
two identical vertices, e.g. of the type of AipM^^ipM, does not yield the quadratic 
divergence, even for m ^ 0, and therefore has been ignored in the discussion above. 
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7.7 Vector superfield 

Though the general superfield, discussed in section 7.4, is a complex field in general, 
we may get an irreducible representation V of SUSY by imposing a hermiticity 
condition 


V f = V, (7.67) 

since the infinitesimal transformation of SUSY is described by an operator, i(8Q + 
8Q), which may be regarded to be hermitian as Q and Q are differential operators. 
Then the component fields appearing as a coefficient of 8a tl 9 is a real vector field 
V M , which may be identified as a gauge field in supersymmetric gauge theories. Here 
we will retain in Abelian gauge theory. The generalization to non-Abelian case is 
rather straightforward, once we realize what are the new features of SUSY gauge 
theories from the simplified Abelian case, since SUSY and gauge symmetries are 
mutually independent symmetries. The “real” superfield may be written as follows 

V = C+ i8 x -idx+yO(M+ iN)- l -W{M-iN) 

z z 

- + i999 (A + ~a»d^x) ~ iM6 {A + -a^d^x) 

+ \6066{D - \uC), (7.68) 

z z 

where bosonic component fields C, M, N, and D are all real. In the above equa¬ 
tion the lower component fields C, x, X have appeared also at the positions of higher 
component fields with derivatives. To see the reason, let us consider a supersym¬ 
metric extension of (Abelian) gauge transformation (see (7.93)) 

V 1 = V + *(A- A+), (7.69) 

with the transformation parameter A being a chiral superfield. Expand the A in 
terms of component fields as A = A(y) + y/26\jj{y) + 69F(y). Then from the “non¬ 
locality” of the coordinate j/ M , 

i( A-At) = i(A - A*) + iy/2(9ip - Ftp) + i69F - i98F* 

- g^Od^A + A*) - -^={999^8^ - 999^8,^) 

v2 

- l - 9999U{A-A*). (7.70) 

It is now easy to read off the SUSY gauge transformation of each component field 

C'->C" = C + i(A-A*), (7.71) 

X -A X' = X + V2</>, (7.72) 
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M + iN -i M' + iN' = M + iN + 2 F, (7.73) 

V^V^V^ + d^A + A*), (7.74) 

A -> A' = A, (7.75) 

D -> D' = D. (7.76) 


We realize that, because of the specific way of writing in (7.68), the gauge fermion 
A and the auxiliary field D are gauge invariant: (7.75) and (7.76). This property 
is desirable, since the gauge fermion, e.g., belongs adjoint representation, Which 
is gauge invariant in the Abelian gauge theory. On the other hand, by use of the 
transformation property (7.71) - (7.73) , one immediately know that by suitable 
choice of the parameters ImA, ip, F, the lower component fields C, x, M, N are 
readily gauged away. Thus, without loss of generality, we may assume the following 
form of the vector superfield, called Wess-Zumino gauge: 

V = -da^OV^ + i(606 A - WdX) + ^0666D . (7.77) 

In some sense, the above situation is similar to what we expect in ordinary gauge 
theories, i.e. the longitudinal component of gauge field, which can be written as a 
derivative of a scalar function may be gauged away. However, we should also note 
that even after moving to the Wess-Zumino gauge, we still have a degree of gauge 
transformation due to the real part of A{x), 

U M -> V; = V M + 2<9 M (ReA), (7.78) 

which corresponds to the ordinary gauge transformation with transformation pa¬ 
rameter Re A. 

In this way, we have identified a SUSY multiplet containing vector field U M , 
called vector multiplet, 

(M*), £(*))• (7-79) 

The mass dimensions of these component fields are readily known to be (1, 3/2, 2). 
Thus A should be a physical field called gauge fermion or gaugino, the SUSY partner 
of gauge boson V M , while D is an auxiliary field, which should be written in terms 
of physical scalar fields once Euler-Lagrange equation for D field is used. 

• Field strength superfield 

In order to get the kinetic term for the vector multiplet, we need to construct 
some gauge invariant (covariant in non-Abelian case) superfield, whose product 
provides the desirable kinetic term, out of gauge variant superfield V. 

We have learned from the aforementioned gauge transformation property that 
the gauge fermion A is gauge invariant, while V M transforms inhomogeneously. Thus 
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a reasonable way to get the gauge invariant superfield is to differentiate V, so that 
it can extract the \ a field as the lowest component: 


3 3 3 

36 36 36 a 


V. 


(7.80) 


This is not a correct way of doing, since the differential operators etc. do not 
commute with SUSY algebra, and some modification is needed: 


W a = -7 DDD a V. 
4 


(7.81) 


Such defined W a and W a are called field strength superfield. 

The field strength superfields turn out to be chiral superfields. In fact, we can 
easily show, for instance, 

Dp W a = - l -DpDDD a V = 0, (7.82) 


where we have used the property D^DD = 0 as the result of {Da, Dp) = 0. Another 
important feature of W a is that it is gauge invariant: 

W a -+W a = Wa- t -DDDa(A-A') = Wa- t -D{D,D a } A 

= W a - 1 - (D, Da}DA = W a , (7.83) 

where we have invoked to the chiral nature of A and A, D&A = D a A* = 0. 

Let us power expand W a over 6, in such a way where component fields are 
functions of y For such purpose we first rewrite V in terms of y 11 = +i{6cr fl 6): 


V = -6o»6V^y) + i{666A{y) ~ 666\{y)) 

+ ^6666{D{y)+id fl V> i {y)). (7.84) 

We also rewrite “SUSY-covariant” derivatives (7.44), (7.45) in terms of y^ and 

6 a ,6 & : 


D a 

Da 


r\ 

— + 2i{a> i )aaO a d ll , 

3 

36°' 


(7.85) 

(7.86) 


Then, according to the definition (7.81), W a is obtained after some calculus to be 


W a 


= -iA a (y) + [<5 a *D{y) - l -{^a v ) a *(0^(11) ~ d v V^y))]6p 
+ ee{p*) a& d^iy). (7.87) 


As we expected the lowest component is the gauge fermion A and all of A, D and 
the field strength = d^Vv — 3 v V^ are gauge invariant. 
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• Kinetic term for the gauge field 

From the above expression for W a , it is clear that W a W a \p provides the kinetic 
term —iXa^d^A, etc. for the members of gauge multiplet: 

Cgauge-kin = ^ (W a W a \ F + h.C.) 

= - i\ a {a^) a ^ A* + l -D\ (7.88) 

The kinetic term for gauge boson V ^ is of ordinary type. The kinetic term for the 
gauge fermion A may be understood as that for a Weyl fermion. Since the gauge 
fields should be “real”, it may be natural to expect that gauge fermion is described 
by a Majorana fermion Am = (A a , A a )L In fact, the kinetic term can be re-written 
as |Am*^Am, up to a total derivative. (A Weyl and a Majorana fermions are iden¬ 
tical, as far as the fermions are massless.) As we expected the “kinetic term” for the 
D field does not have any derivative and D should be understood as an auxiliary 
field. 

• Gauge invariant Lagrangian 

Let us consider a SUSY £7(1) gauge theory, whose typical example is SUSY 
QED. Generalization to the non-Abelian case is rather straightforward. 

Suppose we have chiral superfields fc, which have electric charges Qi. Under a 
global ?7(1) gauge transformation the chiral superfield transform as 

= e- i2eQiA <j>i, (7.89) 

where gauge transformation parameter A should be regarded as a chiral superfield, 
not to spoil the chirality of (f> by the transformation. Just as in ordinary field theory, 
the self interaction of chiral superfields, i.e. superpotential should be invariant under 
the transformation (7.89): 

W(4>i) -t W{(t>' i ) = W{<t> i ). (7.90) 

When the U{1) transformation is made local, allowing A to depend on the 
superpotential clearly remains invariant. The kinetic term for 4>i, however, is no 
longer invariant, 

-)• e~ 2ieQi{ - A ~ A ^ 4>\<S)i. (7.91) 

In order to make it gauge invariant we now introduce the vector superfield V to com¬ 
pensate the factor e - 2ie< 3<( A - At ) ) i. e . the kinetic term is now replaced by “covariant 
derivative” term, 

<t>\e 2eQiV <j>i, (7.92) 

where V should transform under the local gauge transformation inhomogeneously, 

V -t V - '= V + i(A-A t ). (7.93) 
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Actually the Taylor expansion of e 2eQiV ends at V 2 : V 2 = -OdOOV^V^, V n = 0 
for n > 3. By use of this fact we can check that the SUSY invariant (up to a total 
derivative) D-term of (7.92) really contains the ordinary covariant derivatives for 
scalars and fermions, though it also contain interaction terms of A, A ,D, which are 
characteristic to the SUSY gauge theory: 

<t>\e 2eQiV = (D IJ .A i )*(D fl A i ) - i^D^i + \F\ 2 

- V2ieQi(Ai\ipi - A*\i/ji) + eQiD\At\ 2 , (7.94) 

where D M denotes the ordinary gauge-covariant derivatives 

D^Ai - (<9 m + ieQiVfAjAi, = (<9 M + ieQiV^ipi- (7.95) 

The terms in the second line of (7.94) are the new types of gauge interactions due 
to the presence of gaugino A, A and auxiliary field D, whose coupling constants are 
identical, i.e. e, because of the SUSY. 


7.8 SUSY QED 

As the immediate application of the SUSY U( 1) gauge theory discussed above, let 
us consider the supersymmetric extention of QED, namely SUSY QED. 

The matter field, electron, should be identified with the fermionic components 
of two (right-handed) chiral superfields, 

= (e*,e*,F*), <t>+ = (el,e+,F* L ), (7.96) 

where ej t and = (e£) c are the right-handed components of electron and positron 
in 2-component notation, and by we denote the SUSY partners of L , “se- 
lectron”. The vector superfield V = (V M , A, D) contains the photon field and its 
superpartner, “photino”, A. 

Since the electron has a bare mass term —mee in ordinary QED, we employ the 
following superpotential 

W = m<t>-cj> + . (7.97) 

Combining with the SUSY invariant and gauge invariant kinetic terms for vector 
and chiral superfields, we get the full Lagrangian for the SUSY QED 

£ = ^(W a W a \ F + h.c.)+4>le~ 2eV 4>-\ D + cj)le 2eV (t> + \ D 

— (m<j>^<f> + \F + h.c.). (7.98) 

Writing explicitly in terms of the component fields, the Lagrangian reads as 

L = --^F^-iXa^d^X+^D 2 
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Fig. 7.3 


+ {I^e R )'{D»e R )^{D»eiy{D»el) 

— ie R a^ D^e R — ie^a^ + \F r \ + |-Fl| 

+ V2ie(e R \e R - h.c.) - V2ie(e* L Xe J - h.c.) - eD{\e R \ 2 - jez,| 2 ) 

+ m(e R e R +h.c.) ~ m(e R Fl + e* L F R + h.c.). (7.99) 

So far spinors for electron and positron have been all written down by use of the 
2-component notation. When we calculate Feynman diagrams, however, it will be 
more convenient to utilize the familiar 4-component notation, so that we can use 
the ordinary form for the spinor propagators. Thus we define 4-component spinors, 
Dirac spinor e denoting the electron, and a Majorana spinor Am for the photino: 


e = 


Am = 



(7.100) 

(7.101) 


In terms of these 4-component spinors, the Lagrangian is given as 

C = A f ^f^+ 1 -^Xm + \d 2 

+ |D M e#| 2 -I- |D M ez| 2 -I- ielfe + |Fr| 2 + |Fl| 2 

+ v / 2ie{(e^A M 1+ 2 75 e + e* L X M 1 e) - h.c.} - eD(|e fl | 2 - |ez| 2 ) 

- mee-m{e* R F L +elF R + h.c.). (7.102) 

As we clearly see in the Feynman rule shown in Fig. 7.3 the coupling constants of 
the photon, photino and the auxiliary fields are the same, i.e. e. 

After eliminating the auxiliary fields F Ri l and D, the scalar potential gets con¬ 
tributions from both of superpotential and gauge interaction. To see this explicitly, 
let us write down the terms including the auxiliary fields in the Lagrangian 

\F r \ 2 + |Fl| 2 - m(e R F* L + e* L F R + h.c.) + l -D 2 - eD(|e fl | 2 - |e L | 2 ). (7.103) 
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By use of equations of motion for the auxiliary fields 

F r = met, F l = me R , 

D = e (\e R \ 2 -\e L \ 2 ), (7.104) 

we get the scalar potential 

V = \F R \ 2 + \F L \ 2 + ^D 2 

= rn 2 (\e R \ 2 + |e L | 2 ) + ^e 2 (|e fl | 2 - |e L | 2 ) 2 . (7.105) 

We easily find that the vacuum state of this theory is achieved by vacuum expecta¬ 
tion values ( e R ) — (et) = 0. The corresponding vacuum energy E v = (0|£f|0) also 
vanishes, and it indicates that SUSY is not spontaneously broken, as we will see in 
the following section. The term m 2 (|efl| 2 + |et| 2 ) in the potential V thus provides 
scalar mass-squared m 2 . This means the masses of electron and selectron are the 
same as the inevitable consequence of SUSY. 


7.9 SUSY Yang-Mills theories 

The argument for SUSY 17(1) gauge theory is easily generalized to non-Abelian 
gauge theory including the SUSY extension of the standard model, such as MSSM. 

The transformation of a set of chiral superfields, denoted by a column vector <j>, 
belonging to a representation of non-Abelian Yang-Mills gauge group, takes a form 

<j> -> 0' = e~ i29A 4>, (7.106) 

where A is a matrix belonging to the Lie algebra 

A = T a A a , (7.107) 

where A a are transformation parameters and the generators T° satisfy (in a suitable 
normalization) 

[T a ,T b ] = if abc T c , Tr ( T a T b ) = ^ 6 ab . (7.108) 

Correspondingly the vector superfields V a belonging to the adjoint representation 
of the group behave as 

e 2 9 V e 2 gV' = e -i2g\' e 2gV e i2gh (7.109) 

where 

V - T a V a , V' = T a V' a , (7.110) 

so that the term 0te 2 ® v <f> becomes gauge invariant. 
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A gauge covariant field strength superfield is obtained by a straightforward ex¬ 
tension of the Abelian case (7.81), 

W a = - i DDe~ 29V D a e 2gV . (7.111) 

It is easy to show that such defined field strength has a desirable transformation 
rule, 


W a ->W' a = e~ i2gK W a e i2gK . (7.112) 

Thus gauge invariant and SUSY invariant Yang-Mills theory with a gauge in¬ 
variant superpotential W (</>) is described by 

C = ~Tv(W Q W a \ F + h.c.) + <t>'e 29V <l>\ D 

- ( W{cj>)\ F + h.c .). (7.113) 

The generalization to the case with several chiral superfields is quite straightforward. 
Let us note that we still can impose a condition of Wess-Zumino gauge for each 
of V a . The Feynman rule shown in Fig. 7.3 for SUSY QCD is now modified in a 
trivial manner by a prescription, 


-ie -*• igT a , (7.114) 

for the gauge interaction vertices of (V“,\%f, D a ), with (e, e ) being replaced by the 
scalar and spinor components of <f>. 


7.10 Minimal Supersymmetric Standard Model (MSSM) 

The SUSY Yang-Mills theory of the most physical interest is the SUSY extension 
of the standard model. Its minimal version, with minimal number of Higgs chiral 
multiplets, is called Minimal Supersymmetric Standard Model (MSSM). In this sec¬ 
tion, we discuss MSSM very briefly, focusing on the basic structure of the model. 
For more detailed discussion on the MSSM and its various phenomenological im¬ 
plications, we ask readers to refer to the nice reviews in the literature (Haber and 
Kane, 1985; Weinberg, 2000). 

Basically the construction of the MSSM is achieved just following the argument 
of the previous section. In particular the Lagrangian is given by (7.113), though the 
second term, the kinetic term (with covariant derivative) of chiral field (j), should 
be replaced by the sum of the kinetic term of each existing chiral field with distinct 
gauge generator V = T a V a corresponding to its reperesentation of the gauge group 
SU( 3) c x SU (2 )i x U (l)y. The superpotential W of the chiral superfields may 
take, in principle, the most general form compatible with gauge symmetry, as in 
the standard model. We will argue, however, that some restriction is needed, since 
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otherwise some serious problems, such as too rapid proton decay is induced. The 
restriction is due to a global symmetry called H-symmetry, as we will see below. 

The gauge group of MSSM SU( 3) c x SU(2)l x U (l)y is just the same As that of 
the standard model, and the gauge bosons and gauge fermions form vector mmmul- 
tiplets, whose interactions are uniquely determined by gayge principle, as is seen in 
(7.113). The matter fields, on the other hand, are of our choice. In the stabdard 
model, the quarks and leptons to start with are Weyl fermions. Thus it is natural 
to assign quarks and leptons as the members of chiral multiples. The remaining 
matter fields, Higgs, being complex fields, should belong to a chiral multiplet. 

Actually we realize that to construct a realistic theory, one SU(2)l doublet of 
Higgs chiral superfields is not enough, and we should add another doublet. The 
reason is two-fold. 

(1) In the (non-SUSY) standard model the Yukawa couplings or masses of down- 
type and up-type quarks are provided by <fi and 4> = i<T2<t>* , respectively. One may 
naively expect that the Yukawa couplings are just replaced by the corresponding 
superpotential of the chiral superfields of quarks and the Higgs doublet. Unfortu¬ 
nately, its does not work. This is because, is obtained by the operation of complex 
conjugation (charge conjugation), and therefore the chiralities of (/> and 4> are op¬ 
posite, while superpotential should be the polynomial of superfields with the same 
chirality. We thus need to introduce two independent Higgs doublets in order to 
provide masses to down-type and up-type quarks, say Hd and Hu, with the same 
representations of the gauge group as those of <j> and <f>, both being left-handed chiral 
superfields. 

(2) Suppose we introduce chiral superfields whose scalar components are the mem¬ 
bers of one Higgs doublet. As the new feature of the SUSY theory, chiral superfields 
also contain the fermionic superpartners of the Higgs scalars, i.e. Higgs fermions. As 
the Higgs fermions are Weyl fermions, they yield triangle gauge anomalies. For ex¬ 
ample “L/(l)y” anomaly, coming from the triangle diagram where all three vertices 
are made of the currents coupled with the U (l)y gauge boson, arises by the presence 
of the Higgs fermion. Ther also appear the (SU(2)l) 2 U (l)y type anomaly as well. 
Such anomalies will be vancelled by the introduction of another Higgs doublet, 
just because they have just opposite quantum numbers, with weak-hypercharges 
Y = ±1. For instance U (l)y anomaly will vanish as (+1) 3 -I- (—l) 3 = 0. 

Thus the matter fields of MSSM with 3 generations of quarks and leptons are 
all given as left-handed chiral superfields 

Qi = ( 2 ) > li ( * = 1,2,3) 

h ° = ( - ( i ) ’ (7 ' u5) 

where we have used the same letters to denote the chiral superfields as those of the 
ordinary particles in the standard model. For instance for generic quark q and Higgs 
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H chiral superfields, their spin s = 0,1/2 components together with the auxiliary 
fields F g , Fh are written as 

q=(q,q,F g ), H=(H,H,F h ), (7.116) 

where q and H are the superpartners of ordinary particles, i.e. a squark and a Higgs 
fermion. 

Next thing to do for the construction of the MSSM is to choose the form of the 
superpotential W. To provide the Yukawa couplings necessary to give quark and 
lepton masses and the self-interaction of Higgs doublets, we employ the following 
superpotential 

W = fijUiQjHy + ftjdiQjHD + flfkL.Ho 

+ (iHuHd, (7.117) 

where i — 1,2,3 is the generation index, and the abbreviation QjHu etc. is used 
for Q t j(i(J 2 )Hu etc., to denote the SU (2) invariant products of two doublets. 

Actually this superpotential is not the most general one compatible with gauge 
symmetry. Let us note that in the standard model, the Higgs potential is the most 
general one, so that it can prepare every counterterms for possible UV-divergences 
at the quantum level. It is important to note that in the supersymmetric standard 
model, the lepton doublet L and the Higgs doublet Ho have exactly the same 
quantum number and a priori there is no way to distinguish these two, though in 
the non-SUSY standard model, they were clearly discriminated by the difference 
of their spins. In fact, the above superpotential has not included, for instance, the 
following type of possible gauge invariant operators 

udd, dQL, IL 2 , (7.118) 

These terms clearly break either of baryon or lepton number, B or L, and will easily 
lead to too rapid B or L violating processes such as proton decay, or too frequent 
FCNC processes such as /i —> ey, as the coefficients of these operators are generally 
independent of the Yukawa couplings given in (7.117) and can be new sources of 
flavor mixings. Thus it is desirable to devise a mechanism to guarantee the absence 
of these undesirable operators. 

Generally, to ignore terms, or operators, which is compatible with gauge sym¬ 
metry is not allowed for the theory to be renormalizable, since such terms are 
gauge invariant and they are generally induced at the quantum level with UV- 
divergence and the counterterms are needed anyway. Thus the elimination of the 
terms listed up in (7.118) does not seem to be justified. (One may expect that the 
“non-renormalization theorem”, characteristic to the SUSY theories, may work as 
the justification. The SUSY, however, should be broken eventually for the theory 
to be realistic.) The elimination is, at least, technically allowed if the elimination 
is able to be regarded as the consequence of some global symmetry, independent of 
the local gauge symmetry. It is not hard to imagine that the terms incompatible 
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with the global symmetry, imposed on the whole Lagrangian, will never appear at 
any orders of quantum corrections. Let us note that while the tri-linear term in the 
superpotential dQHo yields ordinary Yukawa coupling of ordinary particles dQHo, 
undesirable term dQL will result in a tri-linear coupling with one superpartner and 
two ordinary particles, dQL. Thus in order to distinguish these two types of terms, 
it seems to be natural to devise a global symmetry, which assign different transfor¬ 
mation properties for ordinary particles and their superpartners. We thus consider 
72-symmetry, which is a characteristic symmetry in SUSY theories. 

What we consider is some global U(l) symmetry. It is trivial that to distinguish 
ordinary particles and their superpartners, the global symmetry should assign dif¬ 
ferent U( 1) charges to the different members of the same chiral multiplet. Hence 
a global symmetry which assign an overall phase for the all members of a chiral 
multiplet is not enough for our purpose. 72-symmetry is the symmetry under the 
chiral transformation of the Grassmannian coordinates of superspace, 

0 -> O' = e^O, 0 O' = e- iLf 6, (7.119) 


or 


o»¥>75 


Accordingly a generic chiral superfield </> transforms as 

<f> = A + V2Oxp + OOF -><£'= e icifi <f> = A' + y/20'tp 1 + 0'0'F', 


(7.120) 


(7.121) 


where c denotes the overall 17(1) charge of the chiral multiplet. The each component 
field therfore transforms as 


A' = e icifi , xj}' = F' = e i(c “ 2 )v F. (7.122) 

The F-term of the superpotential W\f ~ Jp- W should be invariant under the trans¬ 
formation of 72-symmetry. Thus the transformation property of the superpotential 
should be 


W -> W' = e 2 iv W. (7.123) 

For instance, for the superpotential W = <t>i<t>2<l>3 of some three chiral superfields 
to be i?-symmetric, their U(l) charges should satisfy Ci + C 2 + C 3 = 2. Let us note 
that, since the 7?-symmetry is a chiral symmetry, other terms in the Lagrangian, 
i.e. kinetic terms with covariant derivatives are automatically A-symmetric, though 
the gauge fermions transform non-trivially. 

We will assign the 17(1) charge c = 1 for all of quark and lepton chiral superfields, 
Q , u, d, L, l, and c = 0 for Higgs chiral superfields Hd and H\j. By taking a specific 
value ip — 7 T, we can define a discrete symmetry, “7?-parity”. The (overall) parity 


~Pufuc. 7/fafAewtafic.af PAy.lrc.J. 



178 


SUPERSYMMETRY 


of quark and lepton superfields is -1, while that of Higgs superfields is 1. Now the 
condition reduce, with p> = ir, to 

W' - W, Y, Ci=0 ( mod 2 )- ( 7 . 124 ) 

i 

It is now easy to see that the all terms in (7.118) do not satisfy the above condition, 
while the terms in (7.117) are all consitent with the condition. 

For each component field of the chiral multiplets the assigned U (1) phases for 
tp — 7r, i.e. 77-parity denoted by 77, is given by a compact form, 

77 = (-1) 2J (-1) C = (-1) 2J {-1) 3B+L , (7.125) 

where J is the spin of the field and B,L are baryon and lepton numbers, and 
377 + L = 1 for quarks and leptons and 0 for Higgs doublets. We thus easily know 
that all of the quarks, leptons and Higgs have 77 = 1 while their superpartnes all 
have 7? = — 1. In this way we have suceeded to distinguish ordinary particles and 
their superpartners. In addition, the absence of the undesirable terms in (7.118) 
is justified as the consequence of the 77-parity symmetry. We further note that, 
in general, hevier superpartners having odd 77-parity should decay into lighter su¬ 
perpartners and other ordinary particles, but not into a final state composed only 
of ordinary particles, which have even 77-parity. Namely the 77-parity invariance 
enforces the lightest supersymmetric particle (LSP) to be absolutely stable, thus 
being an interesting candidate of (cold) dark matter. 

Though we have implicitly assumed that the SUSY is an exact symmetry, actu¬ 
ally the SUSY should be broken spontaneously (see section 7.12) or explicitly (with 
“soft” terms, whose coefficients have positive mass dimensions, in order not to spoil 
the nice property of the quadratic divergence cancellation), since the superpartners 
of ordinary quarks and leptons having the same masses as those of the ordinary 
particles have not been discovered. The theory should also possess the spontaneous 
gauge symmetry breaking. These two kinds of symmetry breaking cause the mixings 
among various particles via their mass (-squared) terms, in various sectors, which 
makes the complete analysis of MSSM (Haber and Kane, 1985; Weinberg, 2000) 
rather cumbersome. 

We finally briefly comment on the specific feature of MSSM in its Higgs sector. 
After the elimination of the auxiliary fields Fh u d , the quadratic terms of Higgs 
bosons in the obtained scalar potential are \h\ 2 (H\)Hd + H^Hu). This in turn 
means that the spontaneous gauge symmetry breaking does not happen, as long as 
SUSY is preserved. An intriguing mechanism to achieve the negative mass-squared 
of Higgs boson, necessary for the symmetry breakdown, has been proposed (Inoue, 
et al., 1982), where the negative mass-squared is realized by the SUSY breaking 
and the quantum correction due to the loop of a heavy fermion with large Yukawa 
coupling, i.e. t quark. 

We also note that the (Higgs scalar) 4 term of the scalar potential is uniquely 
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provided by the D -term contribution, e.g. - HyHu) 2 which is obtained 

after the elimination of D-auxiliary field of 17(1) vector multiplet. Thus the coeffi¬ 
cient of the quartic term is completely determined by the gauge coupling constants 
and is of the order of g 2 or g' 2 . This means that the Higgs masses are comparable 
to Mz or Mw, provided SUSY is an exact symmetry. Though actually the SUSY 
breaking mass-squared terms for the Higgs bosons considerably modify the range 
of the Higgs masses, there should remain at least one light physical Higgs boson, 
whose mass < Mz- Intuitively, this is because at the limit of Msusy -> oo, the 
theory should reduce to the non-SUSY standard model, where one physical Higgs, 
whose mass is independent of Msusy, should exist. 


7.11 Some Phenomenological Predictions of SUSY Gauge Theories 

In this section we will discuss very briefly how we can search for the trace of SUY 
gauge theories for elementary particles and/or how we can test the characteristic 
phenomenological predictions of SUSY gauge theories. 

Very successful prediction of the SUSY version of the standard model, Minimal 
Supersymmetric Standard Model (MSSM), is the gauge coupling unification. In 
Grand Unified Theories (GUT), which unify all interactions (except gravity) of 
elementary particles, three gauge couplings of SU( 3) c , SU(2)l, U{1)y, denoted 
by gz,g 2 , 9 \ for suitably normalized gauge generators, should be unified at higher 
energies. Since these couplings are those for “strong”, “weak” interactions etc., 
such couplings do not seem to be unified at least at low energy processes. Actually 
the “asymptotic freedom” of non-Abelian gauge theories, and the “asymptotic non- 
free” nature of Abelian gauge theory, briefly discussed in chapter 5, implied by the 
renormalization group equations to discribe the energy evolution of gauge couplings, 
make the grand unification possible. Since the dependence of the gauge couplings 
on the energy is very mild, depending only logarithmically, the mass (energy) scale 
of the grand unification Mgut is quite high. In the prototype (non-SUSY) 517(5) 
GUT (Georgi and Glashow, 1974) Mgut = G(10 15 )(GeV). Such tremendously 
high mass scale makes the life time of proton decay, typically predicted by GUT, 
quite long, assuring the (approximate) stability of atoms. 

The simplest SU (5) GUT model, however, seems to be facing the following diffi¬ 
culties. (a) First, it predicts the proton life time rp ~ 3 x It) 31 (years), much longer 
the age of our universe. Nevertheless, the prediction has been ruled out by the 
recent (Super-) Kamiokande result on the decay mode p -> n° + e + . (b) Second, 
the precision measurement of the gauge couplings < 73 , 02,91 at E ~ 10 2 (GeV) at 
LEP experiments (CERN) has revealed the fact that these couplings do not meet 
with each other at an unique value of higher energy. It is quite impressive that 
such difficulties can be evaded (though there still may remain some problem) in 
SUSY SU(5) GUT (Sakai, 1981; Dimopoulos and Georgi, 1981), whose low-energy 





180 


SUPERSYMMETRY 


effective theory is MSSM. Namely in the SUSY GUT, the evolution of gauge cou¬ 
plings, or the ^-functions in the renormalization group equations are modified by 
the presence of the newly introduced superpartners of ordinary particles (mainly 
by the gauge fermions). If the SUSY breaking mass scale Msusy is in a reasonable 
range, Msusy ~ 1 (TeV), all of such modified three “running” couplings meet at a 
unique value of the energy ! Furthermore, the modification of the evolution raises 
the unification scale a little bit, Mgut — 10 16 (GeU). This little change, however, 
considerably raises the proton life time Tp (since rp oc Mq UT ), thus evading the 
lower bound on rp imposed by (Super-)Kamiokande experiment. It may be worth 
noting that the most recent result on the lower bound of rp from Super-Kamiokande 
experiment is quite close to the prediction of the SUSY GUT, even though they 
still seem to be mutually compatible. 

Probably the most direct confirmation of SUSY will be the discovery of the char¬ 
acteristic new heavy particles, i.e. the superpartners, in accelerator experiments, 
and extensive efforts have been made in such direction. Such superpartners should 
affect the low-energy physics as well, and their contributions may be (indirectly) 
tested by use of various low energy observables, whose probabilities have been mea¬ 
sured precisely. In particular the SUSY breaking masses of the order Msusy of 
these superpartners can be arbitrary a priori, as long as they are gauge invariant. 
Therefore they may potentially spoil the successful predictions of the (non-SUSY) 
standard model, unless there are some guiding principle to control them. 

For illustrative purpose, here we will briefly discuss the contributions of super¬ 
partners of quarks and leptons (squarks and sleptons) to the observables, which are 
rendered to be small by global symmetries, i.e. (1) A p and (2) Flavor Changing 
Neutral Current (FCNC). These are quantities discussed in detail in chapters 8 and 
9, respectively, and are handled by global symmetries, i.e. “custodial” and flavor 
(or horizontal) symmetries. We will see in chapters 8 and 9, that because of these 
global symmetries, these observables exactly vanish at the tree level. Though the 
observables are induced at the quantum (loop) level, they are automatically finite 
and are suppressed. Thus the smallness of these quantities are guaranteed without 
any tuning of parameters of the theory, the property we call “natural”. If SUSY is 
exact, i.e. if there is no SUSY breaking, the situation essentially does not change, 
and natural suppression of these observables are still operative. Once SUSY is 
broken, which is necessary anyway for the models to be realistic, the situation re¬ 
markably change, generally speaking. In particular, as we will argue below, FCNC 
may get into trouble, since the SUSY breaking mass terms, being flavor or gen¬ 
eration dependent, break the flavor symmetry. Thus unless there is some guiding 
principle to control the mass terms, the natural suppression of FCNC is spoilt, and 
it may cause a serious phenomenological problem of the model. 

(!) 

The p-parameter is defined in (4.105) and (8.5) as the ratio of charged and 
neutral weak gauge boson mass-squared, p = 1 at the tree or classical level because 
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of the SU(2)v (or SU(2)l) global “custodial” symmetry which remain in the Higgs 
sector even after the spontaneous symmetry breakdown (see 8.4). However, the 
symmetry is broken in the entire Lagrangian by e.g. the mass splitting between 
the members of SU(2)l doublet, such as ( t,b) l , the parameter deviates from 1 , 
i.e. A p = p — 1 7 ^ 0. The contribution of the heavy top quark to A p turns out to 
behave as -rk~. Then, one may naively expect that the superpartners, which are 

m w 

supposed to be heavy, ~ l(TeV’) or so, may strongly affect A p and SUSY gauge 
theories may contradict with the experimental upper bound on A p. Fortunately the 
SUSY breaking mass-squared given for the doublet of squarks, for instance (ti , d) 4 , 
does not break the global symmetry. Namely the SU{2)i gauge invariance makes 
the SUSY breaking mass-squared term for the squarks Mg USY {\u\ 2 + \d\ 2 ) invariant 
under the transformation of the custodial SU( 2 ) symmetry. Thus we expect the 
contribution of the squark doublet is under control. In fact we will see in (8.18) that 
the contribution is strongly suppressed by the factor [rn\ — m^) 2 /(M^Mg USY ) (the 
mixing between the superpartners of left-handed and right-handed quarks has been 
ignored for brevity). This kind of suppression is generally stated as the “decoupling” 
phenomena of heavy particles, which we will systematically analyze in section 8 . 2 . 
Thus the newly added contributions, characteristic to SUSY gauge models, are not 
troublesome concerning Ap, as long as Msusy is not too small. 

(2) FCNC 

We will discuss in section 9.2 that to guarantee the natural suppression of FCNC 
processes, whose rates are experimentally stringently bound and these processes are 
often called rare processes, fermions of the same electric charge and chirality should 
belong to the same representation of the gauge group (Glashow-Weinberg’s condi¬ 
tion). Since the SUSY generators commute with those of gauge symmetry, these 
two symmetries are mutually independent. Thus the superpartners should belong to 
the same representations as those of ordinary quarks and leptons, and the Glashow- 
Weinberg’s condition is still satisfied in SUSY extension of the gauge theories sat¬ 
isfying the condition. In fact, each of neutral currents coupled to Z weak gauge 
boson in quark and squark sectors, for instance, is flavor-diagonal and does not have 
FCNC. There appear, however, new type of neutral current in SUSY gauge theories, 
namely the (fermionic) current coupled with gauge fermion, which is bi-linear form 
of a fermion and its superpartner, of the type shown in Fig. 7.3(b). As we will see 
below if SUSY breaking mass-squared for superpartners are flavor-dependent, there 
arises a mismatching between the mass(-squared) matrices of fermions and super¬ 
partners, and when we move to their mass-eigenstates the neutral current coupled 
to the gauge fermion may have FCNC. This happens already at the SUSY QED 
discussed in 7.8 once the model discussed there is generalized so that it includes 
(three) generations of leptons. Thus leptonic FCNC process, for instance /t —t e-y, 
becomes possible via the exchange of the gauge fermion, photino A m, as shown in 
Fig. 7.4. 

We are now ready to confirm the above statement by a little explicit calculation 
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A m 



Fig. 7.4 


in the SUSY QED with three generations of leptons. The chiral multiples in (7.96) 
is generalized to include three generations, a = 1,2,3, as 

4>- a = (i aR ,l- R ,F aR ), 4>+a = (raL,li R ,K L ) (a = 1,2,3), (7.126) 

where l a , l a are current-eigenstates, the eigenstates of gauge interactions. The 
Lagrangian is the same as (7.98) except that the chiral multiples become triplicate 
and their superpotential is given by 

W = m^ a (t)- a (j>+0, (7.127) 

where mp a may be regarded as the (J3, a) elements of a mass matrix m. The mass 
terms of leptons l a and sleptons l a are easily obtained as the generalization of the 
mass and mass-squared terms for e and e given in (7.102) and (7.105): 

£ m = -{hml R + h.c.) - {P R {rrPm)l R + P L {mrrP)l L }, (7.128) 

where the lepton and slepton states are denoted by column vectors Il <r = 
^ 2 L,Ri Isl,rY and Il.r = {Iil,r, hL,R,hL,R) t ■ It is obvious that the mass 
matrix for lepton and mass-squared matrices for sleptons are diagonalized simulta¬ 
neously by the same unitary matrices Ul and Ur: 

U{mU R = m diag , (7.129) 

U' R (m'm)U R = m^ iag , U[{mrn))U L = m£ iagl (7.130) 

and the vectors of current-eigenstates are related to the vectors of mass-eigenstates 
lmL,R,imL,R by I l ,r = UL,.Rl m L,Ri h,R = Ui,. R lm,L,R- On the other hand the 
interaction due to the neutral current coupled to the photino Am is given in the 
base of current-eigenstates as 

Cnc = V2ie{P R \MlR + - h.c.}. (7.131) 

It is clear that even if we move to the mass-eigenstates l m L,R and l m L,R, the neutral 
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current is still flavor-diagonal and no FCNC is induced, i.e. 

Cnc = 'J2ie{i\ nR \Ml m R + P nL X MlmL - h.c.}, (7.132) 

since U R r Ul,r = 7 (7 is the 3x3 unit matrix). Namely all neutral currents have 
no FCNC at tree level. More intuitively we may say that the superpotential (7.127) 
itself can be diagonalized by Ul,r without changing the kinetic terms for chiral 
multiplets. 

The situation changes once we introduce SUSY breaking mass-squared matrices 
M'l R , only for sleptons (for brevity, possible mass-squared term to mix II and Ir 
is ignored), 

CsusYbr = ~{P r (M 2 r )Ir + i[(Ml)h}. (7.133) 

Suppose the SUSY breaking is caused by some interactions, which are indepen¬ 
dent of flavors, such as gauge interactions. Then the SUSY breaking mass-squared 
matrices do not break the flavor symmetry and L should be invariant under 
the unitary transformations by Ul,r■ This argument implies that M R L should be 
proportional to the unit matrix, 

M'l = M\ usyl ■ I, M r = M RUSYR ■ I, (7.134) 

where Msusy,l and Msusy,r denote the mass scales of the SUSY breaking. As 
far as the SUSY breaking does not break the flavor (or horizontal) symmetry, we 
expect that the same argument as that in the case of exact SUSY will holds. In 
fact, we readily realize that the slepton mass-squared terms, including the SUSY 
breaking terms, are simultaneously diagonalized as the diagonalization of the lepton 
mass matrix, 

U R (m^m + Ms USYR I)Ur = m di a g + M RUSYR I, (7.135) 

+ M rusyl I)Ur = m diag " l ~ Msus Y , L I- (7.136) 

Thus there does not appear FCNC in the neutral current coupled with the photino. 

In the MSSM, the SUSY extension of the standard model, there appears the 
flavor or generation mixings in the charged currents coupled with the superpartners 
of W ± as well. But such mixing is described by the same matrix as the KM matrix, 
as far as the SUSY breaking mass-squared matrices are proportional to I. We 
also learn from (7.135) and (7.136) that the mass-squared differences of squarks or 
sleptons in MSSM, which play essential roles in FCNC processes, are the same as 
those of quarks or leptons. For instance 

™ 2 c ~ ml = m 2 c - m 2 u . (7.137) 

Thus the rates of FCNC processes are suppressed by the small quark mass-squared 
differences and/or small flavor (generation) mixings, just as in the case of the stan- 
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dard model. This mechanism may be called as “super-GIM” mechanism (Barbieri 
and Gatto, 1982; Inami and Lim, 1982). 

We, however, have to be aware of the fact that a priori there is no mean to 
restrict the SUSY breaking mass-squared terms. In fact M“[ R may be arbitrary, 
without contradicting with the gauge symmetry. Hence, unless there is some good 
reason to guarantee the flavor symmetry in the SUSY breaking masses, the model 
will face a serious problem of too large event rates of FCNC processes, such as 
/r —► e 7 decay shown in Fig. 7.4, whose branching ratio should be extremely small, 
< 4.9 x 10 -11 even if the process ever exists. In such a sense, the study of FCNC 
processes may be quite crucial in order to select the mechanism of SUSY breaking, 
which strongly characterizes the SUSY theories. 


7.12 Spontaneous SUSY breaking 

We have seen in the argument of SUSY QED that the masses of electron and se- 
lectron should be the same provided the SUSY holds. Experimentally, however, no 
selectron with the mass m e has been observed. Thus SUSY should not be an exact 
symmetry in the nature and should be broken spontaneously or explicitly. Unfortu¬ 
nately, no successful spontaneous SUSY breaking in the sector containing ordinary 
quarks and leptons is known because of some phenomenological difficulty, though 
the spontaneous breaking is theoretically more appealing. Thus in the MSSM, for 
instance, explicit SUSY breaking scalar mass-squared terms, for instance, are added 
to the Lagrangian (by hand). We, however, discuss in this subsection the mecha¬ 
nisms of spontaneous SUSY breaking, since it is not only theoretically appealing, 
but also is expected to be the origin of the “explicit SUSY breaking”. In fact, it 
is argued that in the MSSM embedded into a supergravity theory, for instance, the 
explicit SUSY breaking terms are brought into the “observed” sector of quarks and 
leptons via supergravity interaction from some “Higgs sector”, which is decoupled 
from the observed sector once supergravity interaction is switched off and where 
the SUSY is spontaneously broken. 

First let us note that the vacuum energy, i.e. the vacuum expectation value 
(VEV) of the Hamiltonian E v — (0|7/j0) is the order parameter of the spontaneous 
SUSY breaking. This is easily seen from the relation in SUSY algebra, 

{QcnQa} = ~2 (a^ac, P^. (7.138) 

As only cr° has non-zero trace, Trer 0 = -2, we find 

{QuQi} + {$ 2 , 02 } = —2Trcr°P 0 = 4 H. (7.139) 

As the each term in the left hand side is semi-positive definite operator, we find 
E v = (0|i7|0) > 0. If E v vanishes we conclude SUSY is not spontaneously broken 
as Q q | 0) = <3a|0) = 0. Thus to break SUSY spontaneously, E v > 0 is necessary. 
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In fact if this is the case, under some SUSY transformation due to Q a or Q&, the 
vacuum state is not invariant. Thus 

Spontaneous SUSY breaking —> (0|ff|0) > 0. (7.140) 

Actually it is quite easy to realize a SUSY model with E v > 0. Let us imagine a 
Wess-Zumino model with a chiral superfield </> and a simple superpotential W (<f>) = 
c <p (c : constant). The resultant scalar potential is positive, V = |1 2 = |c| 2 > 
0, and SUSY is expected to be broken, formally speaking. It, however, does not lead 
to any mass splitting between the scalar A and the spinor ip components of <p. In 
fact, W\f = c F (F : auxiliary field), which has no self-interaction, nor mass terms 
for A and ip. Thus E v > 0 is necessary but not sufficient condition for “physical” 
SUSY breaking. (This is why the arrow in (7.140) has only one direction.) In the 
following we discuss possible mechanisms to get physical SUSY breaking. 

In general, scalar potential is the sum of the squared absolute values of auxiliary 
fields, 

V = \Fi\ 2 + (7.141) 

where the auxiliary fields are written in terms of scalar fields A{ as implied by the 
equations of motion, 

F i = D a — g A*(T a )iAj, (7.142) 

where g is the gauge coupling and T a are the gauge generators acting on A,. Thus 
to achieve spontaneous SUSY breaking with constant scalar vacuum expectation 
values, necessary for Lotentz invariance, E v = (0|U|0) = |(Fj)| 2 + |(D a ) 2 > 0 is 
needed ((F*) = (0|F*|0) = etc.). We therefore think of three possibilities 

to realize the spontaneous SUSY breaking: 

(a) (Fi) A 0, irrespectively of (D a ) 

( b ) (D a ) A 0, irrespectively of (Fi) (7.143) 

(c) (Fi) = 0 and (D a ) =0 do not hold simultaneously, 

where in the former two cases we do not have to care whether other types of auxil¬ 
iary fields develop vanishing VEV’s or not. The spontaneous SUSY breaking of type 
(a) and (c) are well-known and were proposed by O’Raifeartaigh (O’Raifeartaigh, 
1975) and Fayet-Iliopoulos (Fayet and Iliopoulos, 1974). The third type (b) (Fayet, 
1976; Inami, Lim and Sakai, 1983) is less-known, but will be discussed here, as it 
is one logical possibility and has a nice feature that SUSY is spontaneously broken 
purely due to the gauge interaction. 

(a) (Fi) A 0 (O’Raifeartaigh mechanism) 

We seek a model where (Fi) = 0 are not satisfied for all i, i.e. at least (Fi) A 0 
for some i, no matter (D a ) A 0 or not for some a. Hence we consider Wess-Zumino 
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model with several chiral superfields, without considering their gauge interaction. 
As we have pointed out above, ( F ) ^ 0 itself is easily realized only with 1 chiral 
superfield, though it does not lead to any physical SUSY breaking. O’Raifeartaigh 
found that to realize physically meaningful SUSY breaking at least three chiral 
superfields are necessary. Namely he found that a superpotential W for three chiral 
superfields <f>i (i = 1,2,3), 


W{4>) - - M 2 ) + 


yields the SUSY breaking, since not all of 


F* 


Fo 


dW(A) 

dAi 

dW(A) 

dA 2 

dW{A) 

dA 3 


= A(A 2 - M 2 ) = 0, 

= fiA 3 = 0, 

= 2AAiA 3 + /j,A 2 = 0, 


(7.144) 


(7.145) 


are satisfied for any VEV’s of Ai, 2 , 3 - Assume, for simplicity, that M, A, fi are 
all real and /x 2 > 2A 2 M 2 , then the minimum of the scalar potential V = A 2 |A 2 — 
M 2 | 2 + /x 2 |A 3 | 2 + \2\AiA 3 + fiA 2 1 2 is known to be achieved by (Ai) = (A 2 ) = 
(A 3 ) = 0. Thus ( V) = A 2 M 4 > 0, as expected. Furthermore, we can check 
that there appear mass splitting among the masses of scalars and fermions. Since 
all VEV’s of scalars are 0, the masses of fermions and scalars are easily read off 
from the quadratic terms in W and V, respectively: pip 2 i[> 3 for the fermion mass, 
-2A 2 M 2 Re(A|) + /x 2 (|A 2 | 2 + |A 3 | 2 ) for the scalar mass-squared. Namely the Weyl 
fermions ip 2 , ip 3 are combined to form a Dirac fermion 4/ = {%p 3 , -0 2 ) f with a mass 


= n 


(7.146) 


while the scalar masses are given as 

= 0 > = if, rrf ReA 3 = if - 2A 2 M 2 , m] mAz = if + 2A 2 M 2 . (7.147) 

Thus the masses of fermions and scalars are not identical any more, as the con¬ 
sequence of SUSY breaking. We, however, also note that, even though the mass 
splitting does occur, the average mass-squared of fermions and scalars are still 
identical. To be more strict, so-called super-trace of mass-squared matrix, i.e. 
StrM 2 = ^2j(— 1) 2J (2J + l)m 2 vanishes when SUSY is spontaneously broken. 
Here mj is the mass of the field with spin J field, where each of Majorana fermion 
or real scalar with mass m contributes to the sum as —2m 2 and m 2 . For instance, 
in the O’Raifeartaigh model we can explicitly confirm that StrM 2 = 2 x (0 + /x 2 ) + 
(if — 2A 2 M 2 ) + (if + 2A 2 M 2 ) - 2 x (2 x | + l)^x 2 = 0, where the multiplication 
factor 2 denotes that the corresponding masses are for complex scalars and a Dirac 
fermion. 
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It is exactly this fact of vanishing super-trace that causes the phenomenological 
difficulty when we attempt to construct a SUSY model where SUSY is spontaneously 
broken in the sector containing ordinary quarks and leptons, since it means that 
some of scalar partners of quarks or leptons should be lighter than corresponding 
quarks or leptons, which is not acceptable as we have not observed any of such 
superpartners. 

In the O’Raifeartaigh model, once gauge interaction is included, all of the su¬ 
perfields (£ 1 , 2,3 should be singlets under gauge transformation, for W to be gauge 
invariant, which may be not appealing if SUSY is to be relevant for high energy 
theory. Thus we next consider the case (c), which realizes the spontaneous SUSY 
breaking in a SUSY gauge theory. 

(c) { F%) = 0 and { D a ) — 0 do not hold simultaneously (Fayet-Iliopoulos 
mechanism) 

Fayet and Iliopoulos proposed a model where (Fi) = 0 and (D a ) = 0 are incom¬ 
patible, and therefore SUSY is spontaneously broken. Thus, in this model, SUSY 
is broken as the result of the interplay between the superpotential, i.e the self¬ 
interaction of chiral fields, and gauge interaction, and the mechanism is different 
from (b), discussed below, where SUSY is broken purely due to the gauge interaction. 

They essentially considered a SUSY QED discussed in section 7.8. They, how¬ 
ever, put an additional SUSY and gauge invariant term with a real parameter £, 

-£V\d = -£D, (7.148) 

which is called Fayet-Iliopoulos D-term. (Of course, in no-Abelian gauge theory 
this term is no longer gauge invariant and is forbidden.) 

The scalar potential for the selectrons e^.L is now modified from (7.105) into 

V = m 2 (|e R | 2 + |e t | 2 ) + |(e|e jR | 2 - e|e' L | 2 + £) 2 - (7.149) 

We immediately realize that if m = 0 or £ = 0, SUSY is not broken, since the 
vacuum states with E v = 0 are realized by the VEV’s |(£r)| 2 = |(ez,)| 2 — ^ or 
|(ej?)| = |(ei)| = 0, respectively. On the other hand if both of m and £ are non¬ 
vanishing, SUSY is spontaneously broken, since the two conditions 

\e R \ 2 + \e L \ 2 = 0, e|e H | 2 -e|ez,| 2 + £ = 0, (7.150) 


are not satisfied simultaneously. 

Depending on the sign of m 2 — e£ (assuming £ > 0), the minimum of the potential 
V is given by the following VEV’s 

(fr) = (e-L,) = 0, for m 2 - e£ > 0, 

(ep) = 0, ('Ll) = for m 2 - e£ < 0. (7.151) 
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Fig. 7.5 


In the former case, though SUSY is broken, the 1/(1) gauge symmetry is not spon¬ 
taneously broken with vanishing VEV’s, while in the latter case both symmetries 
are spontaneously broken (see Fig. 7.5). 

In the case m 2 — e£ > 0, for instance, the masses of electron and selectrons are 
given as 


m e = m, m\ R = m 2 + e£, m\ L — m 2 — e£. (7.152) 

Thus, again we get the remarkable relation 

StrM 2 — 2x (m 2 +e() + 2x (m 2 - ef) - 2 x (2 x i + l)m 2 = 0. (7.153) 

(b) (D a ) 7 ^ 0 (The third possibility) 

As the third possibility we consider a case where ( D a ) = 0 are not simultaneously 
satisfied by any choice of VEV’s of scalar fields, leading to E v > 0. Namely, in 
this case SUSY is spontaneously broken purely due to gauge interaction. For the 
transparency of the argument, here we neglect superpotential, since the presence is 
irrelevant for the mechanism to work. 

Ignoring the superpotential, the scalar potential, the values of VEV’s and the 
breaking of SUSY is completely determined by the choice of gauge group and the 
group representations of matter chiral superfields. One restriction is that the gauge 
group should have at least one 17(1) factor, since otherwise Fayet-Iliopoulos D-term 
is not allowed and supersymmetric vacuum state is easily realized for all vanishing 
VEV’s of scalar fields. 

The simplest model to realize the SUSY breaking turns out to be 5(7(2) x (7(1) 
SUSY gauge theory (Fayet, 1976; Inami, Lim and Sakai, 1983) with Fayet-Iliopoulos 
D-term —g£D (we denote gauge couplings of 5(7(2) and (7(1) by g and g) and two 
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chiral superfields 

cj> (Q=|), <£' (Q = -|), (Q : U(l) charge), (7.154) 

both belonging to the doublet representation of 5(7 (2). Let us note the assignment 
of the (7(1) charges Q = 1 / 2 , -1/2 is such that the triangle gauge anomaly disap¬ 
pears. By use of the equations of motion for D auxiliary fields, the auxiliary fields 
are written in terms of scalar fields, which we denote by the same letters <j) and <j>', 
as 

D a = g{4> ] T a <j> + ^TV) (T a = —) for 5(7(2) 

D = ~9(\<PU ~ \<P'W + 0 for (7(1). (7.155) 

By use of the 5(7(2) x (7(1) symmetry of the theory, we can assume the following 
form of the VEV’s of scalar fields, without loss of generality, 

<«=(!)• w>=("f) ■ < 7 - i56 > 

where x,y,z and 9 are all real. 

Then the conditions ( D a ) = 0 and ( D) = 0 read 

yz = 0 for (jD 1 ) = (D 2 ) = 0, 
x 2 -y 2 +z 2 = 0 for (D 3 ) = 0, (7.157) 

x 2 -y 2 - z 2 + 2i = 0 for (D) = 0. 

It is easy to check that for £ 7 ^ 0 these three conditions are not met simultaneously 
By any choice of x, y and z. Thus, we conclude that E v = ( V) = |(Z ? a ) 2 + |(77 ) 2 > 0 
and SUSY is spontaneously broken, irrespectively of the choice of the superpotential. 

This argument can be generalized to 5(7(n) x (7(1) SUSY gauge theories. It 
turns out that SUSY is spontaneously broken provided the number of pairs of 
chiral superfields belonging to n and n representations of SU(n) with opposite 
(7(1) charges is less than n (Inami, Lim and Sakai, 1983). The 5(7(2) x (7(1) theory 
with a single pair of chiral superfields just discussed above is the simplest example. 


Problems 

7.1 Prove the following relations concerning the 2-component spinors (anti-commuting 
Grassman numbers) 9 and 9 and the 2x2 matrices cr M and <r M . In 99, 9cr9 etc., 
spinor indices have been suppressed: 

a. 9 a 6< i = 99, 9 a 9 0 = \e a0 99, 
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8*60 = \e«m, he ft = ~\e^66 

(7.158) 

b. 

{et^e^ea^B) = 53 '“'( 00 ) (M) 

6d 

(7.159) 

c. 

( 0 ^l) ( 0 ^ 2 ) = -i(V’lV’ 2 )( 00 ) 

(7.160) 

d. 

det (<r M p M ) = det (5^p M ) = p^jf 

(7.161) 

e. 

(a ")* 0 = c^c“^K)^ 

(7.162) 

f- 

{a^a" + a"cr»} a 0 = 2g^S a 0 

(7.163) 

9- 

{d^a v + a v o»Y p = 2 Sf iv &* p 

(7.164) 

h. 

Tr (< 7 ^") = 2 

(7.165) 

i. 

= 2S a 0 8a 0 

(7.166) 

3- 

^a v a x - a x a v a M = - 2 it» vXK o K (e 0 i 2 3 = -e 0123 = 1 ) 

(7.167) 

k. 

a*a"a X - <r Vct" = 2 

(7.168) 

1. 

+ cr Vo* = - 2 (g^ x c u - g vX a^ - g^a x ), 

(7.169) 

m. 

<j' 1 a v <T X + a -= - 2 {jf x a v - g^a* - g» v <T X ). 

(7.170) 

7.2 Verify the relation in (7.31), 


G(0,e,e) 

■ G(x ", 6, d) = G(^ + ida^e - *«o^0,0 + e, 0 + e). 

(7.171) 


7.3 Find the propagator of the auxiliary field F of a chiral superfield (consulting, 
for instance, with the Lagrangian given in (7.61)). Apply the propagator to the 
calculation of the diagram in the left of Fig. 7.2(a), and show that the coupling 
-4 ig 2 should be assigned for the vertex of the diagram in the right of Fig. 7.2(a). 

7.4 Show that in the simplified Wess-Zumino model with only one chiral superfield 

</>, whose Lagrangian is given in (7.66), there is an exact cancellation of the quadratic 
divergence between the diagrams Fig. 7.2(a) and (b). Show also that the another 
type of diagrams with the loop of fermionic field, coming form the contraction of 
two identical vertices, e.g. AiJim does not yield the quadratic divergence, 

even for m/ 0 . 

7.5 Verify (7.87) and (7.88). 
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Chapter 8 

PRECISION TEST OF ELECTROWEAK 
RADIATIVE CORRECTIONS AND 
NEW PHYSICS 


8.1 The Meaning of Precision Test of Electroweak Radiative Cor¬ 
rections 

The standard model of elementary particles, especially its electroweak sector, was 
devised as a renormalizable gauge theory with massive vector bosons such as Z° 
and W ± . Already at the classical (tree) level, without quantum corrections, the 
standard model has its characteristic properties and such properties have been ex¬ 
tensively tested experimentally. We may pick up some typical examples of such 
tests: discovery of the predicted neutral current processes, confirmation of the ex¬ 
istence of Z° and W ± with their predicted masses, forward-backward asymmetries 
Afb in the scattering e + e~ ->//(/: fi, etc.). 

The genuine features of the theory as the renormalizable theory, however, should 
be studied by comparing the calculated (finite but small) quantum effects of the 
theory on the physical observables, i.e. radiative corrections, with the data obtained 
in the precision experiments, such as LEP, CDF, etc. 

Let us recall how the renormalization procedure goes, namely how the finite 
(calculable) radiative corrections are obtainable in the renormalizable theory. To 
avoid unnecessary complication, let us focus on the gauge sector, namely the part 
of the Lagrangian, which contains gauge bosons. Then we have 3 bare parameters: 
2 gauge couplings g, g' of SU(2)l and U (l)y and the vacuum expectation value 
(VEV) of the neutral Higgs boson, v. What we should do first is to calculate 
observables, which have been excellently well-measured experimentally, i.e. 

M z = 91.150(30) GeV (from LEP, SLC), (8.1) 

G f = 1.16637(2) x 10~ 5 (GeV) -2 (from ft -» ep^v e ), (8.2) 

a = 137.0359895(61) -1 (from g — 2 of electron), (8.3) 

including quantum effects in terms of the 3 bare parameters. Namely, we calculate 
these quantities as the functions of 3 bare parameters and other parameters, such 
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as m t , mu'- 


Mz(g,g' ,v,m t ,mH,- ■ ■), GF{g,g',v-,m t ,mH,---), a(g,g', v;m t ,mH, •• •), (8-4) 


which at the first sight have UV (ultra-violet) divergences, but actually are finite 
“renormalized” quantities. (Or we may say that the bare parameters themselves 
are UV divergent.) Equating these 3 functions with the measured values listed 
above provides formulae, which enables us to express the bare parameters in terms 
of observables Mz etc.: g = <j>(M z , Gf, a; mt, run, • • •)> etc. Then, we may cal¬ 
culate any other observables in terms of bare parameters, /(< 7 , </, u; m t , m#, • • ■), 
which are superficially divergent. Once the functions <j>(Mz,---) are substituted 
for the bare parameters g etc., they become the functions of well-measured quan¬ 
tities: f(g,g',v;mt,mH,---) = f(M z ,GF,or,mt,rriH,- ■ ■)• The obtained quantities 
/ should be all finite, as long as the theory is renormalizable. These are real pre¬ 
dictions of the renormalizable theory. 

The above argument on the quantum effects is easily understood, relying on 
(local gauge invariant) operators. As is well-known in 4 space-time dimension, a 
renormalizable theory should contain only marginal or relevant operators with mass 
dimension d = 4 or d < 4 to start with. After including quantum effects, there also 
appear irrelevant operators with d > 4, together with the marginal and relevant 
operators. The marginal and relevant operators get ultra-violet(UV) divergent cor¬ 
rections on the coefficients of their operators, such as A 2 or In (^), with A and 
p being momentum cutoff and the renormalization scale. (The momentum cutoff 
potentially break local gauge invariance and usually alternative regularization meth¬ 
ods such as dimensional regularization may be used.) The quantum corrections to 
g and g’ and the corrections to gauge boson mass-squared Mf, M^,, keeping the 
relation of bare masses, 


P = 


M 2 W 

cos 2 6w 



(8.5) 


belong to this category. On the other hand the coefficients of irrelevant operators, 
induced by the quantum effects, should be automatically finite; otherwise the UV- 
divergences cannot be removed, since there are no irrelevant operators nor countert¬ 
erms in the original Lagrangian. The physical observable A p = p -1 or equivalently 
Mfcos 2 # - Mfy, caused, e.g., by a mass splitting between the members of SU(2 )l 
doublet, is a typical example. Apparently, it seems that the radiatively induced A p 
is UV-divergent, as the gauge boson mass-squared seems to be the coefficient of a 
d = 2 relevant operator. This apparent contradiction is resolved once we invoke to 
the language of (local) gauge invariant operators with Higgs field included. As far 
as the theory to start with is gauge invariant, every operators induced or corrected 
by quantum or radiative effects should be all gauge invariant. This is, say, a picture 
in the “symmetric phase” of the theory. Of course the gauge boson mass-squared 
stems from the spontaneous symmetry breaking (SSB), i.e. in the “broken phase” 
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of the theory. Thus the mass-squared (difference) relevant for A p should be under¬ 
stood as the result of the replacement of the (neutral) Higgs field by its VEV in 
an irrelevant operator composed of Higgs field, as well as gauge bosons. In fact we 
will see in the later section that A p can be regarded as the coefficient of a d = 6 
operator. This is why the radiatively induced A p is finite. 

The precision tests of such finite radiative “corrections” (quantum effects) to 
electroweak parameters, such as gauge couplings and gauge boson masses, are quite 
important to check the validity of the standard model, since these finite effects are 
genuine predictions of the model. For instance such tests are performed by the 
precision measurements of various parity violating asymmetries such as forward- 
backward asymmetry or LR asymmetry in e + e _ scattering (LEP etc.). The heavy 
particles of the SM, like t quark and Higgs, and still unknown heavy particles 
predicted by theories beyond the standard model (“New Physics”) do not directly 
appear in the external lines of Feynman diagrams of lower energy processes. They 
appear only or mainly through their contributions to gauge boson self-energies, 
i.e. through indirect ’’oblique corrections”. Strictly speaking, e.g. the t quark 
may contribute to bbZ vertex, and these extra contributions have to be estimated 
depending on each case. It, however, is still true that the oblique corrections exist 
universally when we consider the radiative corrections of heavy particles and are 
worth general investigations. 

As the coefficients of irrelevant operators have negative mass dimensions (d < 
0 ), we may expect that the finite quantum effects due to heavy particles may be 
suppressed by the inverse powers of the heavy particle masses (“decoupling” of 
heavy particles in low energy processes). But it turns out that this is not always 
true, and some radiative corrections are not suppressed by the inverse powers. Such 
“non-decoupling” effects in oblique corrections potentially appear only in restricted 
number of ’’oblique parameters”, S, T and U, which parameterize electro-weak 
radiative corrections. The precision tests of the 3 parameters 5, T , U are quite 
important not only to get useful information of the heavy particles in the standard 
model, m t and mn, but also to test various theories of New Physics. In fact, 
(original version of) “technicolor” theory was ruled out as the result of the precision 
test of the 5-parameter (Peskin and Takeuchi, 1990). 

The purpose of this chapter is to discuss the precision test of the radiative 
corrections to the electro-weak parameters in some detail, not only due to the heavy 
(comparable or greater than Mw) t quark or Higgs in the standard model, but also 
due to the heavy particles predicted by various New Physics. We, however, assume 
that the gauge group we take is the same as the standard model, i.e. SU(2)l x 
U(l) y . 


~Pu.*uc. 



194 


PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ... 


8.2 Decoupling and Non-decoupling 

As we discussed above, precision test of the radiative corrections due to heavy 
particles provides useful information on the properties of heavy particles such as 
their masses, or even rules out their existence. The prediction of c quark mass by 
Gaillard and Lee (Gaillard and Lee, 1974) before its discovery, utilizing the data 
on the flavor changing neutral current processes of neutral kaons K Q , K°, is the 
remarkable example. 

Heavy particles are heavy in the sense that the processes we treat are those 
for lighter particles at lower energies. That is why to get information on their 
masses indirectly through radiative corrections is meaningful. Heavy particles, even 
though they cannot appear as the real states in the lower energy processes, may 
participate in the processes as the virtual or intermediate states. We, however, 
naively anticipate that the effects are suppressed by the inverse powers of the heavy 
particle masses, which we generically denote by M, since the propagators of heavy 
particles behave as (for scalars) 

A : 2 -M 2 ~ ^ 

in lower (|fc 2 | <C M 2 ) energies. At the tree level, this replacement is justified. In 
the loop diagrams, however, the replacement is no longer justified, since the 4- 
momentum varies as the loop momentum in the integration. Thus whether the 
heavy particle contributions are really suppressed by the inverse powers of their 
masses or not is a non-trivial question to be addressed. 

In gauge theories without SSB, having only parity preserving vector-like (non- 
chiral) interactions, such as QED or QCD, so-called “decoupling theorem” holds 
(Appelquist and Carrazone, 1975): the contributions of heavy particles with mass 
M to physical observables are suppressed by the powers of 1/M. Let us take the case 
of QED as an example. Suppose we have a heavy charged lepton E~ with the mass 
M, say a partner of ordinary electron belonging to hypothetical heavy generation. 
Let us consider its contributions as virtual states in the processes described by 
Feynman diagrams with n external photon lines (see Fig. 8.1). 

The quantum effects provide effective Lagrangian for external photon fields 

C e/ f = Y, C <( M ) ( 8 ' ? ) 

i 

where Oi denote local gauge invariant operators with n photon fields and Ci are 

2 

their coefficient functions, which depend M in addition to a = as the result of 
loop integration of internal lines. Note that the mass dimension of the operators 
Oi are greater than n, di > n, in general. This is because to make gauge invariant 
operators, we need field strength F^„ rather than itself, which has derivatives 
of A thus making the dimension of the operators higher than n. Thus, as is clear 
from dimensional analysis, for n > 4 the coefficient functions are suppressed by 
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7 



Fig. 8.1 


inverse powers of M, 1/M di ” 4 (d* > n > 4). What all of these are saying is that 
the coefficients of irrelevant operators are suppressed by the inverse powers of M. 

This kind of dimensional analysis no longer holds for the case of marginal op¬ 
erator, which appears in the case of n = 2, i.e. self-energy diagram. This just 
corresponds to the correction to (We also get operators with higher 

derivatives, which we ignore as they are contained in the category of irrelevant 
operators.) The coefficient is dimensionless and behave as f* dt In (), 
with A, t being momentum cutoff and Feynman parameter. Obviously, this co¬ 
efficient is not suppressed by large M. This correction, however, is not a pre¬ 
diction of the theory as it is UV-divergent. After imposing a renormilization 
condition that the correction disappear at - q 2 = p 2 , which is met by adding a 
counterterm made from bare parameters, the correction is rendered to be finite: 
fo dt ln ( ~ (~Q 2 P 2 ), which may be approximated at low energies 

p 2 , \q 2 \ < M 2 to be, - fg dt t(l - t) a ~^§~ = m* ' < 1- Hence, again the net 
effect of the heavy particle is suppressed by an inverse power of M, though it does 
contribute to the renormalization of the bare parameters. These are contents of the 
“decoupling theorem”. 

This situation is psychologically good, since when we calculate some physical 
quantities just relying on the standard model we do not have to worry about what 
really is the theory in higher energies where unknown heavy particles are supposed 
to exist: these heavy particles may give some extra contributions but they are 
safely small. However, it also says that we cannot have significant information con¬ 
cerning the properties, such as masses, of the heavy particles or concerning New 
Physics. Fortunately, in chiral theories with SSB like the standard model, some 
“non-decoupling” effects are known to exist. As the above argument of decoupling 
theorem is a convincing one, one may wonder what is new in such theories. The 
key ingredient, which leads to the non-decoupling, is the fact that now (at least 
some of) the coupling constants are proportional to masses of the relevant par¬ 
ticles. More precisely, for instance in the standard model, all massive particles 
(including Higgs itself) get their masses only through spontaneous gauge symmetry 
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Fig. 8.2 


breaking (SSB), i.e. through their couplings with the Higgs field, which develops 
VEV. Thus when some particle is heavy that means its coupling with the Higgs, 
such as Yukawa couplings and Higgs self-coupling, are strong. (Hereafter we will 
assume that the couplings are not extremely strong, not to spoil the perturbative 
expansion.) For instance the Yukawa coupling of the top quark f t is proportional to 
Pit, ft — m t /(v/\/2) = Such couplings appear in the vertices of Feynman 

diagrams and therefore in the numerators of transition amplitudes of the processes 
we consider, and the above argument on the decoupling theorem, solely relying 
on dimensional analysis, is no longer valid. We thus anticipate that in some suit¬ 
ably chosen processes, heavy particle contributions may be enhanced by (positive) 
powers of M. 

We will list up some known examples of such non-decoupling effects of heavy 
particles in the following. 


(a) Non-linear sigma model 

The non-decoupling effect is obtainable already at tree level. This seems to 
contradict with our intuition that the propagator of heavy particles are suppressed 
by 1 /M 2 as we have discussed above. Concerning the Higgs self-interaction in the 
standard model, however, this naive argument no longer holds. Let us consider 
4-point scattering amplitude of charged Higgs (p + (“would-be Goldstone” particle) 
as shown in Fig. 8.2: 

In Fig. 8.2(a), the exchange of the “physical” neutral Higgs H yields a propa¬ 
gator i/(q 2 - Mjj) ~ -i(l/Mfj)-i(q 2 /Mfj), which is suppressed by On the 

other hand the vertices are both enhanced by an factor proportional to Mjj, i.e. by 
—ig 2 M i w ■ Thus this Higgs-exchange diagram provides an effective Lagrangian (in 
momentum space) 


1 

—i x 


2 q 2 - M 2 H 


(~ig 


-) 2 (ip + v~) 2 ~ — (——+ -^-) (<p + ip~) 2 . (8.8) 

2M 2 / y ’ 8 Mjj 1 M$v ^ V K ’ 


The first term without q 2 implies we have effectively got a contact interaction en¬ 
hanced as Mjj. Actually this term is exactly cancelled by the original quartic 
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interaction, shown in the second diagram Fig. 8.2(b), - 9 ^ a (<p + tp~) 2 . Thus in 
the large Mh or equivalently low energy (\q 2 \ -C Mj f ) limit, we have only derivative 
interaction — ■ - ^ 2 (<p + <p~) 0(tp + (p~). This term is independent of M# and indi¬ 
cates a non-decoupling effect of heavy Higgs. We may attach the lines of <p + , <p“ to 
the internal line of H as many times as we want to get higher dimensional operators 
of the fields <p. Such operators turn out to be all derivative interactions and to be 
summarized in a compact form ( v/\/2 being the VEV of the Higgs) 

~ Tr (d»U WU), U = e iS2 ^, (G + = = -i V +,G 3 = \/2Im<p 0 ). 

4 \/2 

(8.9) 

This Lagrangian contain an infinite number of non-linear terms, and corresponds to 
a description of the Higgs doublet (f> in terms of “polar coordinates system” instead 
of ordinary Cartesian coordinates: 

$ = (#) = v -±^- U. (8.10) 

This non-linear realization of the Higgs field is the counterpart of non-linear sigma 
model in QCD, where tp 1 should be understood as 3 pion fields 7 r* (i = 1,2,3), which 
are only particles appearing in the (very) low energy regime of QCD. It is now clear 
why the contact interaction term disappeared: in the polar coordinate system the 
Higgs potential is independent of <p*, as U^U = I (I : unit matrix). 


(b) A p 

We have defined the p-parameter as the ratio of charged gauge boson mass- 
squared to that of the neutral gauge boson (with an extra factor cos 2 9w), which is 
exactly 1 at the tree or classical level. The parameter, however, deviates from 1, i.e. 
Ap = p — 1 7 ^ 0, once the quantum correction is taken into account. Let us consider 
the quantum correction due to the SU(2)l doublet (£, b) 1 . If there ever exists 
extra generation of quarks and leptons, the doublet may be replaced by the doublet 
of fourth generation quarks (£', b') 1 as well. Since W$ = cos 9w Z ,L -I- sin 
and the photon field never gets mass correction due to the unbroken U (l) em 
symmetry, the quantum correction to cos 2 9w is equivalent to the correction 

to the operator Ws^Wif, with 3 denoting the third component of 51/(2) adjoint 
representation. So at 1-loop level, namely as far as 0(a) corrections (denoted by 
the quantities accompanied by 5) are concerned, 




M& + 8M*, + _ 1 SM^+SM^ 

M^ + SM 2 ,, ~ Mjy 2 


~sm 2 W3 ), 


( 8 . 11 ) 


where we have replaced the mass of the charged gauge boson by Mw 12 : 
M'( v+ W + ^W~^ = ^ j s graphically expressed in Fig. 8.3, 

where II al >,(0) (a, b = 1 — 3) denote the scalar part of the vacuum polarization tensors 
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A p-M^ 2 ( + W2 VF2 j 

6,t 6,t ' 

~ 2 ( ^11(0) + n 22 (o)) — 1133(0) 

Fig. 8.3 


t, b 

W3 wQ. py 3 
b,t 


of W afJ , Wh u 2 point functions (after factoring out p M „) at q 2 = 0. Let us note that 
the scalar parts at 0 momentum correspond to the quantum corrections to gauge 
boson mass-squared. 

The contributions of the ( t , b ) 1 doublet is readily obtained by modifying the 
original calculation by Veltman (Veltman, 1977) for the contribution of heavy lepton 
doublet to be 


A p = 


3 a 


1 


167rsin 2 Qw 


( m t + m 2 b - 


2 m?m|_ m 2 t 

„2 in /> 


i w m-t ~ "% "% 

where the factor 3 comes from the colors of quarks. One may check that this 
expression vanishes in the limit of “degenerate doublet” m t = mj. (The factor 

1 m 2 

m a_ t a In is not singular in this limit, but just yields a derivative of the log- 


m 


( 8 . 12 ) 


arithmic function, 1/m 2 . If we take the limit of mb mt, as is realized in the 
nature, we get 


A p 


3a 


mf 


167rsin 2 dw Myy ’ 


(8.13) 


which clearly shows the enhancement of the quantum effect of heavy top quark, 
proportional to m 2 , i.e. a non-decoupling phenomenon. 


(c) Flavor Changing Neutral Current (FCNC) Processes 

As we have already mentioned, Gaillard and Lee could predict the c quark 
mass before its discovery (Gaillard and Lee, 1974), studying the quantum effects 
of the c quark in low energy FCNC processes of neutral kaons K°, K°, such as 
K° K° mixing. As the neutral kaons are the bound states of d and s quarks, 
K° ~ s'ysd, K° ~ fly 5 s , this mixing is caused by an elementary process sd o sd. 
In this transition, flavor quantum number “strangeness” S changes by two units, 
|AS| = 2, while there is no change of the electric charge. This is why such processes 
are called Flavor Changing Neutral Current (FCNC) processes. We will discuss 
these FCNC processes in some detail in the next chapter. As we discuss there, since 
there is no FCNC interaction at the tree level of the standard model, the sd Ct sd 
transition is induced by the loop diagram, called “box diagram” (Fig. 8.4). 

When Gaillard and Lee discussed the process the third generation had not been 
confirmed. But now its existence is known and it is an important question to ask if 
the heavy t quark gives a non-decoupling quantum effect to this process, thus making 
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U,C,t 



Fig. 8.4 


it meaningful to estimate m t or its flavor mixings with lower generation quarks by 
studying this K° •<-> K° mixing, just following the philosophy of Gaillard and Lee. 
For such purpose it is necessary, since to* > Mw, to calculate the box diagram 
exactly without assuming the internal quark mass is much smaller than Mw, which 
was adopted by Gaillard-Lee’s calculation in the view of the fact m c <C Mw- The 
result of such exact calculation (Inami and Lim, 1981) is written in the form of 
4-Fermi effective Lagrangian (see (9.11)) 


4/f~ 2 = T7 ^ GF 2 a £ (V*V id )(V; s V jd ) Eix^xj) (s^Ld)(s^Ld), (8.14) 
4v i7TSin Ow 


i,j=c,t 


where L = , V ls etc. are KM matrix elements and the coefficient function 

E(xi,Xj) denotes the contributions of internal up-type quarks with masses m,, rtij 
m 2 

(Xi = The (pure) t quark contribution reads as, 


E(x t ) = E(x t ,x t ) 


-®( 

2 y x t 


xt 


T ) s in - [j - “ I (T ^? 1 *> < 8 - 15 > 


For the imaginative limit of x t > 1, i.e. m'j » M^ (though it is not so bad 
approximation), the function behaves as 


E{x t ) ~ - 


4 


1 _m|_ 
4 


(8.16) 


which behaves as mf and is another example of the non-decoupling effect of the 
heavy fermion. It is worth noticing that for c quark, with x c < 1, E(x c ) ~ -x c , 
which just recovers the result of Gaillard-Lee. The calculation (Inami and Lim, 
1981) also shows that similar enhancement proportional to rnf appears in the in¬ 
duced FCNC vertex sdZ (“Z-penguin”), but not in sdj vertex (ordinary “penguin”), 
where we have only logarithmic enhancement In due to the property of the 
CVC. As far as we know the heavy fermion contributions to A p and these FCNC 
processes are only known examples where we have an quadratically growing non¬ 
decoupling effects. Interestingly, it turns out that (at least at 1-loop level) the 
contribution of a heavy Higgs to A p does not grow as Mf {) but grows only as 
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We have seen that the interesting non-decoupling effects arise in chiral theories 
with SSB. Even in such chiral theories, however, heavy particles effects may be 
decoupling. It is the case when heavy particle mass is not provided by SSB, i.e. 
not through large couplings with Higgs, but by a new large mass scale, independent 
of Mw, characterizing some New Physics. Such new mass scales are inevitably 
invariant concerning the gauge symmetry of the standard model, since otherwise 
it will break the gauge symmetry at much higher energies than Mw, which con¬ 
tradicts with reality. In such a sense, the situation is similar to the case of the 
decoupling phenomena we discussed above about a heavy lepton E~, whose large 
mass was also gauge invariant and came to the denominator of the coefficients of the 
effective operators. We will list up some typical examples of such decoupling effects. 

(d) Seesaw mechanism 

In Chapter 6, we discussed the seesaw mechanism. The seesaw mechanism is 

known to be a scenario which naturally lead to light Majorana neutrinos. As we 

2 

have already seen in 6.2, the small Majorana mass with mo ,ma being a 
Dirac mass and the Majorana mass for vr, is induced through a tree level dia¬ 
gram in Fig. 6.1. The fact that the mass is suppressed by 1/rriR can be under¬ 
stood as to indicate the decoupling effect of the intermediate state vr. Let us 
note that 771 r(» Mw) is SU(2)l x U(1)y invariant mass, which may be related 
with the new mass scale of, say, left-right symmetric model of electro-weak interac¬ 
tion SU(2)i x SU(2)r x U(1)b-l (Mohapatra and Senjanovic, 1980) ( B , L denote 
baryon and lepton numbers), where SU{2)r symmetry and therefore parity is spon¬ 
taneously broken at the scale 0(ttir). 


(e) The contribution of super-partners to A p 

As was discussed in Chapter 7, supersymmetric theories predict the presence of 
super-partners of ordinary particles. Here let us consider the contribution of the 
SU{2)l doublet ( u,d )*, the superpartners of light generation quarks (it, d) 1 . As the 
partners have not been observed in, e.g. e + e“ collider experiments, these partners 
should be heavy: m? = m 2 + Mg USY , m 2 - = rn 2 d + Mg USY , where Msusy is 
“SUSY breaking mass scale”, denoting the extent in which SUSY is broken. (We 
have ignored the effect of possible mixing between the super-partners of left- and 
right-handed quarks etc., to avoid unnecessary complication.) These partners are 
heavy not because the Yukawa couplings are large ( m Ut d Mw), but because the 
gauge invariant Msusy is large. We thus anticipate the decoupling of these partners 
in their quantum effects. Their contribution to A p is given as (Alvarez, Gaume, 
Polchinski and Wise, 1983; Barbieri and Maiani, 1983; Lim, Inami and Sakai, 1984) 


A p = 


3a 


167rsin 2 6w M^y 


1,2 2 2m i m i , 

(ml + mr d -5-2 l n — 2 )■ 

d ml - ml ml 

u d a 


(8.17) 


Superficially, the form of this equation is the same as that in (8.12) for ( t, b) 1 doublet 
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Fig. 8.5 


contribution, and the decoupling of super-partners does not seem to be the case. 
As the matter of fact, the decoupling holds: Since m Ui <j C Msusy, we can expand 

m 2 

the formula for Ap in terms of powers of to get 


A p ~ 


a 

167rsin 2 0\v 


K-™2) 2 

M\v Msusy 


(8.18) 


which shows the suppression by -rrr — — — 5 — and therefore the decoupling of 

m SUSY m ii,d 

super-partners. The factor (m 2 - m 2 d ) 2 indicates that A p is described by an ir¬ 
relevant operator which is quartic in the Higgs field, as we will see later in this 
chapter. 


8.3 Oblique Corrections and S, T, U Parameters 

As has been already stated in 6.1, heavy particles of the standard model, t quark 
and Higgs, and heavy particles predicted by New Physics do not directly appear 
in the external lines of Feynman diagrams for the lower energy processes of light 
particles. They appear only or at least mainly through their contributions to gauge 
boson self-energies, so called “oblique corrections”. Up to the order of 0(a 2 ), or up 
to the 1 -loop order, the oblique corrections to the process // —> /'/' are shown in 
Fig. 7.5, where the blob denote the radiative corrections to gauge boson self-energies 
due to the heavy particles. 

Such oblique corrections are known to be conveniently incorporated by “star 
prescription” (Kennedy and Lynn, 1989), namely by replacing the bare quantities 
at tree level amplitude e, s 2 (= sin 2 #w), etc. by corresponding “star” quantities 
e*(q 2 ), S K<1 2 )> etc., which incorporate quantum corrections as well and depend on 
q 2 , with being the momentum of intermediate gauge bosons. More explicitly, 


~Pu.U.c. Tfeai/tQJHaftgqf 
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the effective 4-Fermi Lagrangian for the process (in momentum space) is written as 

(c* = 1 - s*) 

Ceff = elQQ' ± (/V/') 

+ A iH[hL - slQ)f) (f'^WL - slQ'}f), (8-19) 

where Q,Q' and I3 , I 3 are electric charges (in the unit of e) and the third components 
of weak isospin of fermions / and respectively. 

Let us demonstrate this and find the expression of the star quantities in terms 
of the vacuum polarization functions n(g 2 ). We start from writing the radiative 
corrections to gauge boson self-energies denoted by blobs in Fig. 8.5, in the form 
of induced quadratic terms for gauge bosons in the effective Lagrangian, in terms 
of the vacuum polarization functions II(g 2 ): 


\ + i n zz9^Z v + + U ww g tll/ W +ti W-', (8.20) 


where we have included the correction to charged gauge boson self-energy Tlww, for 
later use. We have ignored the part proportional to q^q v in the vacuum polarization 
tensors, since in the scattering amplitude this part provides relatively negligible 
terms which are at most proportional to external light fermion masses after the 
usage of equations of motion, such as < 7 ^/ 7 ^ 75 / = 2m/ / 7 s/. At the order of 0(a 2 ) 
or at the 1-loop level, the effective 4-Fermi Lagrangian relevant for the scattering 
is given as 


£e// 


x 


rss 


x 


( e Qfltif , f s flti{hL - s 2 Q)f ) 

( q 2 — n 77 -n 27 \ -1 / eQ'rrf 

V -n Zl q 2 — Mq — TL ZZ J \f s f'Y{I' z L~s 2 Q’)f' 

( eQfluf ! t s fln{hL-s 2 Q)f ) 


n'. 


9 2 (i-n; 7 ) 

n Zy 


zg _ 
SM* 


q 2 -M'*-n Z z 

<Qfitif) ^(Q'f'itif) 


eQ'f'rf 

fJ'Yil'zL - s 2 Q')f 


i n 77 


+ -fW‘7 AhL - (s 2 - csn Zl )Q)f) 

c s 

x q 2 _ M l _ Uz - (/' lM L - (« 2 - csn Z7 )Q']/'). (8.21) 

where II 77 = q 2 II 77 , Hzz = q 2 II ' zz , and is the bare Z boson mass- 

squared. In the last step, we have used the fact that the off-diagonal element of the 

n' n' 

matrix for propagators, J , may be replaced by n 2 _ A Vf 2 ri — at the 1 -loop level, 


i clclL Pfcyiiei. 
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and can be combined with the contribution of diagonal element accompanied by 
Uzz ■ The key ingredient here is that not only II 77 but also Hz 7 is proportional to 
q 2 because of the CVC of QED. Namely in the diagram with 7 — Z mixing in Fig. 
8 . 6 , the factor of photon propagator 1 /q 2 is cancelled by the factor q 2 in Uz 7 , thus 
making the photon propagator shrunk, yielding a amplitude exactly of the same 
form as that of (the vector coupling part of) pure Z-exchange diagram (see Fig. 
8 . 6 ). 

The factor of Z-propagator q2 _ M l_ n2 ,~ should have a pole at q 2 = Mz, with 
Mz being the well-determined physical Z boson mass: — Mg — Uzz{M z ) = 0, 

which may be used to eliminate the bare mass Mg from our final result. Namely, 
the Taylor expansion of II zz(q 2 ) around M§ gives an expression in terms of the 
physical mass: q 2 - M% - II zz(q 2 ) = q 2 ~ M§ - (q 2 - M|)^ z -|, 2 =M | - n res - 
(1 ~ *^^I< 7 2 =m|)(<? 2 ~ M z ~ fires)> where n re s denotes the part 0((q 2 - Mf) 2 ): 
Tires = Hzz(q 2 )-U Z z(M 2 )-(q 2 -M 2 )^\ q2=M 2 . The factor (1 - ^|, 2=M| ) 
is the residue at the pole of the propagator. 

Now the expression (8.21) takes the same form as (8.19), and we can easily read 
off the star-quantities: 


3 = Tj(i-nU) 


47ra 

" 2 - s 2 - cs n' Z7 , 


{1 - P' (g 2 ) - n; 7 (o)]} 


Si = 


(zb*) 

(tS) 


(1 + 


dHzz | 
dq 2 


2 =M 2 ) 


~ 1 + 


dU zz | 

dq 2 


l 2 =. M % H77 


c 2 - s 2 


cs 


n'z 7 , 


M+ = M z + Ilres 


= M z + Tlzz(q 2 ) - T\zz{M 2 z ) - {q 2 - Mf) \ q2=M2 , 


( 8 . 22 ) 

(8.23) 


(8.24) 

(8.25) 


where [1 — II 77 (0)] has been used. Let us note that generally when the vac¬ 

uum polarization functions II(g 2 ) are Taylor expanded in the powers of q 2 , only 11(0) 
and the first derivative ^\ q 2 =0 are possibly UV-divergent, since they correspond 
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to the radiative corrections to gauge boson mass-squared and kinetic terms. This 
statement is true for the Taylor expansion around arbitrary point, such as q 2 = M|. 
Thus n ' yy {q 2 ) - n; 7 (0) and n reS = n zz (q 2 ) - U ZZ (M |) - (q 2 - M%)4^\ q 2 =M 2 , 
e.g., are finite, since they contain only the terms with 2nd or higher derivatives, and 
we have got formulae in which the finiteness of physical quantities, like e* and M* 
are manifest . The finiteness of Z* is less trivial (See Problem 8.4). The remaining 
formula for s* contains bare quantity s 2 and is not manifestly finite. So we attempt 
to rewrite s 2 by a quantity made from well-measured observables and e* at the Z 
pole, sva26w\z = ^ rom relations 


47 ra = e 2 (l + II 77 (0)) 

(8.26) 

Ml = ^ + n zz (M 2 z ) 

(8.27) 

Gf 1 n^iy(0) 

y/2 ~ 2v 2 c 2 M z h 

(8.28) 


We get a formula for 6w\ z 


sm 2 6\v\ z 


2c 2 s 2 

s 2 + S (s 2 ) = s 2 + 2 cs56w = s 2 + ^ 5(ln(sin2f?w)) 


2 c 2 s 2 6a SGf 6M 2 z 
S + c 2 -s 2 ( ct G f M§ 

2 , cV /TT) , iw(o) n ZZ (M 2 Z ) S 

C 2 - S 2 ^ 7 + c 2 M 2 M 2 > 


(8.29) 


Thus substituting s 2 obtained from the above relation, we finally get 


s 2 = sin 2 <?vv|z - 


c 2 s 2 
c 2 — s 2 


, Rww{0) 
{ 77 c 2 M 2 


n ZZ {M 2 Z ) 
M 2 Z 


)-cs n' Zl (q 2 ). 


(8.30) 


The part described by IPs turns out to be finite from a similar reasoning to the 
case of Z* and from the property that Hww (0) — c 2 U zz (0) is finite, as it should 
be. All of these finite radiative corrections are the genuine predictions of the theory 
we consider and can be precisely tested by various experiments. For instance the 
corrected s 2 may be tested by studying parity violating asymmetries. The asym¬ 
metry of the scattering cross-sections of polarized electrons at the Z boson pole is 
an example, which is very sensitive to the precise value of the Weinberg angle, since 
the vector coupling of the Z boson to the charged lepton like electron is close to 
zero, — \ + sin 2 6w — 0: 


A = °~( e L e+ -> Z ) - g ( e fl e+ z ) 

LR tr(e£e+ —» Z) + <r(e^e+ — > Z) 


8 (| - sin 2 e w ) 

1 + (1 — 4sin 2 6w) 2 


8 (- - sin 2 8w)- 


(8.31) 

The vacuum polarization functions II 77 , II z 7 , H zz can be written, factoring 
the gauge coupling constants out and indicating the indices of SU(2)l and U(l) em 


P'u.Hc. 7/fftfAeHta^rcflf 
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adjoint representations by 1,2,3, and Q, as 

n 77 = e 2 IIqq, (8.32) 

n Z y = ~ (n 3 Q -s 2 n QQ ), (8.33) 

lizz = ~ (n 33 - 2 s 2 n 3Q + s 4 n QQ ), (8.34) 

e 2 

n ww - n n , (8.35) 


where II22 = Iln due to the unbroken U (l) e m symmetry, and it may also be written 
as n u+ n 22 as we ll. We can understand the form of U Zl , II zz easily if we recall 
that neutral current coupled with Z takes a form fj^hL — s 2 Q)f. 

As an example we calculate the contributions to these polarization functions 
from a pair of quarks (£,&)*, which may be regarded as the quarks of another gen¬ 
eration including 4-th generation as well, if it ever exists. All vacuum polarization 
functions are reduced to just 2 independent functions which correspond to the vac¬ 
uum polarization amplitudes induced by two electro-weak currents of quarks with 
the same and opposite chiralities, respectively: 

II L L(m\,ml,q 2 ) =11 RR (m\,ml,q 2 ) 

-- is? £ dt " - 1,(1 -- \ u <8 - 36 > 


II LR{rn\,m\,q 2 ) = U RL {m\,m\,q 2 ) 

= w i dt ' n[ M^T ( <8 ' 37) 

where M 2 = tm\ + (1 — t)m 2 . Writing polarization function induced by vector 
currents as 

Uvv = n l+r,l+r = n ll + n RR + n RR + n R l = 2(n RL + n^), (8.38) 

the polarization functions of our interest are now written in terms of these two 
functions as 

n QQ(9 2 ) = Qt^vv{m^,m 2 t ,q 2 )+ Q 2 b U vv {m 2 b ,rn 2 b ,q 2 ), (8.39) 

n 3 Q(g 2 ) = ^QtTiLv{m 2 t ,m 2 t ,q 2 ) - ^Q b U LV (m 2 b ,m 2 b ,q 2 ), 

= \[Qt^-w{m 2 t ,m 2 t ,q 2 ) - Q b U V v{m 2 b ,m 2 b ,q 2 )], (8.40) 

n 33 (? 2 ) = ^{U LL (m 2 t ,m 2 t ,q 2 )+n L L(m 2 b ,ml,q 2 )}, (8.41) 

IIii(g 2 ) = LL {m 2 t ,m 2 b ,q 2 ). (8.42) 
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It is worth noting that the “chiral amplitude” has non-decoupling effect for m t 2> 
m b , 

6 A 2 

n Lh{m\,m 2 t ,q 2 ) ~ m 2 t In (-^), (8.43) 

n LL (m 2 t ,m 2 b ,q 2 ) ~ m 2 (In j) + \), (8-44) 

which contribute to A p as 

IMO) -n 33 ( 0 ) ~ ~H LL (m 2 ,mlO) - ±Il LL (m 2 ,m 2 t ,0) ~ ^ m 2 . (8.45) 

On the other hand, when IIll and Ulr are summed up to get Ilyy, relevant for 
the vector-like theories such as QED or QCD, the quadratic term m 2 are cancelled 
out and II vv is proportional to q 2 : 

24 f 1 A 2 

rivvK.m^Wf-^ l _,),. )<«}■ (8.46) 

This proportionality to q 2 is the result of CVC valid in the vector-like theories, and 
is an indication that in such theories the decoupling theorem holds. 

The star quantities are functions of q 2 and each of them has an infinite number 
of observables as the functions of heavy particle masses appearing as the coeffi¬ 
cients of the Taylor expansion in terms of q 2 . We, however, can argue that pos¬ 
sible non-decoupling radiative corrections, oblique corrections, of heavy particles 
are concentrated only in three parameters, called S , T, and [/-parameters (Peskin 
and Takeuchi, 1990), which therefore make the analysis of New Physics simple and 
transparent. 

To see this, suppose we have a vacuum polarization function II ij(q 2 ) where the 
indices (i, j) take either SU (2) adjoint indices 1,2,3 or Q corresponding to unbroken 
U(l)em- At least as far as we retain in the 1-loop level, vertices of the diagrams 
of vacuum polarizations are all proportional to dimensionless gauge couplings (at 
higher loop levels we may have mass dependent couplings due to Higgs exchanges), 
and we may rely on a naive dimensional analysis. Namely when we Taylor expand 
it as the power series in q 2 or equivalently in (M: a generic mass of intermediate 
heavy particles), 

Uijiq 2 ) = Uij (0) + ^ =0 q 2 + ■ ■ •, (8.47) 

we readily know that the order of magnitudes of njj(0) and ^¥-|,2 =0 are 0(M 2 ) 

/ 2\2 

and 0 (q 2 ), respectively, and the residual terms are at most of ), which give 

only decoupling effects. Thus as far as our focus is on the possible non-decoupling 
effects we may treat only the first two terms of the expansion. Thus, at the first 
sight, there seem to exist 8 (= 2 x 4) parameters, corresponding to the 4 choices 
of the gauge indices ( i,j): (1,1) = (2,2), (3,3), (3 ,Q), (Q,Q), or equivalently the 
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choices of gauge bosons in the external lines of the vacuum polarization diagrams 
(W + , W~), (Z, Z), ( 7 , 7 ), (Z, 7 ). Actually, not all of these parameters exist. In 
fact, Hqq(0) = n 3 o(0) = 0 as the result of CVC of U(l) em : we never have mass 
renormalization to the photon as we wish the photon to travel at the speed of light. 
We thus have 8 — 2 = 6 parameters remain. We also should aware of the fact that 
some of the remaining 6 parameters should be understood as to be used for the 
renormaliation of three bare quantities, g,g',v describing the gauge sector. This 
is because the first two terms of the Taylor expansion correspond to the radiative 
corrections to the operators with mass dimensions d = 2 and d = 4 (in “broken 
phase” of the theory). Thus among 6 parameters, three parameters are used for 
renormalization, i.e. in the process to fix the bare parameters in terms of a, Gf, Mz 
etc.. In other words, these are inputs rather than the prediction of the theory. In 
this way, the outputs of the theory, i.e. the genuine predictions of the theory, are in 
the remaining 6 — 3 = 3 parameters, which are nothing but the S, T, [/-parameters 
we are interested in. 

As is clear from the above argument, such three parameters S,T,U should be 
automatically finite quantities, as they correspond to operators, which do not exist 
in the original Lagrangian: they need not to be renormalized. Actually, such finite 
combinations of poralization functions have already appeared in the discussion of 
star-quantities, i.e. in the attempt to show the finiteness of the star-quantities. 
Namely the parameters are defined as follows: 


aS = 4e 2 [IT 33 (0) - II 3 Q (0)], 

(8.48) 

aT = ^| [nil(0) - n33(0)] ' 

(8.49) 

aU = 4e 2 [n' u (o)-n' 3 (o)], 

(8.50) 


where we have used a notation IIL(O) to denote ^^-\ q 2 =0 , which are nearly equal 
to II ij(q 2 )/q 2 for the case of ILj(O) = 0, which were originally denoted as II'^ , once 
we ignore We can confirm that S, T, U really do not suffer from the 

UV-divergences. The finiteness of S-parameter is guaranteed by the fact that there 
is no mixing term of SU(2)l and U(l)y field strengths, F^ v in the original La¬ 
grangian. T and U should not get UV-divergence since whose existence contradicts 
with the “custodial symmetry” of the terms F^ u and a -^~ X)i=i 

in the original Lagrangian: no couterterm can be prepared for the quantities which 
violate the symmetry of the original Lagrangian. 

We can easily check that, in the approximation that ) is negligible, the 

star quantities are now describable in terms of S,T, [/-parameters: 

~ 0, (8.51) 

= JTZ15 n (8-52) 


l_ _]_ 

el 47TQ 

s 2 - sm 2 6 w \z 


~Pufuc. 7/faf/taMtafina/ P/t^.Atc.1 



208 


PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ... 


1 — 4 C 2 S 2 (8.53) 

Ml-M 2 z ~ 0, (8.54) 

where e* and M* do not obtain quantum effects in this approximation, i.e. for 
q 2 M 2 , since these quantum effects stem from 2nd or higher derivatives of 11, j. 
In addition, we can list up a related star-quantity p*(0) which is defined as the ratio 
of the 4-Fermi coupling of low energy (q 2 = 0) neutral current process to that of 
charged current process: 

47/ C) = P*mH[hL - slQ}f)U'lAl'zL - slQ']f). (8.55) 

After some arithmetic we find 


P*(0) -l~aT. 


(8.56) 


Thus we know that the T-parameter (times a) is equivalent to A p we had been 
discussing. 

As the example we consider the contribution of a pair of quarks ( t , b) (or equiv¬ 
alently that of ( t',b '), the hypothetical quarks of 4th generation) to the 5, T pa¬ 
rameters. By use of (8.48), (8.49), (8.39)-( 8.42) and (8.36)-(8.38), we easily find 
their contributions to be: 


5 

T 


[1 - l In—} 
2n L 3 rrib 


1 1 


16n c 2 s 2 Ml 


[ m t + m b 


m 2 - m 2 



(8.57) 

(8.58) 


where the formula for T is just A p/a, as we expected. We learn that in the limit 
of global SU(2) “custodial” (or weak isospin) symmetry, i.e. m t = mj, T vanishes, 
just as A p does, but (the first term of) S remains: S = A-. 

This result is relevant especially for the technifermion contribution, since in 
technicolor theory (Susskind, 1979; Weinberg, 1979), just as in QCD, the tech¬ 
nifermion condensation, corresponding to the VEV of Higgs, preserves the custo¬ 
dial symmetry: < TyTy > — < TqTd >, which means that the resultant dynamical 
masses due to technicolor interaction are degenerated, and each technifermion dou¬ 
blet contributes to 5-parameter additively. Thus the precision test on 5 parameter 
is expected to yield a meaningful constraint on the theory. To be precise the above 
calculation cannot be directly applied to the contributions of bound states such 
as techni-hadrons, since technifermions are treated as free fermions in the above 
analysis. Thus more sophisticated approach is desirable, though the above results 
are enough for rough estimation. A sophisticated analysis, relying on the dispersion 
relations of vacuum polarization functions of vector- and axial-vector types, whose 
imaginary parts are measurable R- ratios, provides 


5 = 0.4 (for 1 technifermion doublet) 
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= 2.1 (for 1 “generation” techinicolor model), (8.59) 

assuming that the number of the technicolor Ntc — 4. Comparing with the upper 
bound on S obtained from the precision measurements at LEP etc., Peskin and 
Takeuchi were able to rule out the 1 “generation” technicolor model (Peskin and 
Takeuchi, 1990). 

The constraints on S, T, 17-parameters imposed by the precision experiments at 
LEP etc. are not only used to get information on the new physics, but also quite 
useful to constrain the masses of heavy particles in the standard model, i.e. nit and 
mH ■ Especially, since the T-parameter (A p) is quite sensitive to , it was possible 
to predict rather precisely the value of mt as somewhere around 175(GeV), before 
the direct discovery of the t quark at Fermi Lab. 


8.4 Global Symmetries 


We have seen that all of S, T, U parameters are all “calculable” quantities, free from 
UV-divergence without any need of renormalization. /.From a bit different point of 
view this is the consequences of global symmetries, which remain in the gauge sector 
of the theory even after the SSB. To see this, let us take a matrix form of the Higgs 
fields and introduce a global SU(2)r symmetry. 

We start by writing the Yukawa coupling in the following form with the matrix 
form $ of the Higgs field (ignoring its VEV for a while), 


Ty ukawa 


fu ( UL, d L ) j u R + f d ( UL, d L )[ Jo* j dR + h.c. 

(u L , * ) *('■ l)(Z) +h - C - (8 ' 60) 


where 


$ = 


<P 




,o* 


H+iG 3 
V2 
iG~ 


-iG+ 

H-iG 3 

V2 


= -= [HI + iG* a i 

V2 


(8.61) 


which just corresponds to the linear-u model discussed in QCD to describe the low 
energy effective theory, where H and G 1 are replaced by a scalar and pseudo-scalar 
bound states of quarks, a and it 1 (i = 1,2,3). The technicolor theory takes this 
analogy seriously and regard the Higgs fields as the bound states of hypothetical 
fermions with quantum numbers “technicolors”, instead of ordinary colors, called 
technifermions T. In particular H ~ T v Tu + T D T D (for 1 doublet model of techni¬ 
color), though the Arc , the mass scale where the technicolor interaction becomes 
strong, is scaled up to 0(lTeV). 

If the degeneracy m u = m d is realized, the theory is invariant under a global 
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chiral symmetry SU (2)/ x SU (2) R : 



$ -+ = 9L $ 


( 9 l, 9 r- elements of SU(2)£ and SU(2) fl ) (8.62) 

Let us note the squared absolute-value of the Higgs doublet is given in this matrix 
form as Tr(4>t $) = H 2 +G t2 , which is clearly invariant under the 517(2)/, xSU(2)r: 
$') = iv($t $) ■ H > 2 + G >i2 = H 2 + G i2 This means the real fields 

H, G l behave as the fundamental representation of 50(4), which is equivalent to 
50(2) x 50(2), as is well-known in the group theory. Just as in QCD, after Higgs 
develops its VEV v ( H — > v + H), corresponding to non-vanishing < uu >=< dd > 
in QCD, the chiral symmetry is spontaneously broken, but leaving its vector-like 
subgroup SU(2)v generated by 9l — 9r = 9 unbroken: 



This remaining vector-like symmetry is called “custodial symmetry” (Sikivie et al., 
1980), or we may just call it as (global) weak isospin symmetry. Let us note that 
such custodial symmetry is valid only when the degeneracy of the quark masses 
m u = md is realized, since 

9 ' (o t) S *{ 0 /„)' (8 ' 64) 

unless f u = fd- This is intuitively trivial as m u ^ rrid should break the weak isospin 
symmetry. 

The SU (2 )l gauge bosons (i = 1,2,3) and U(l)y gauge boson transform 
under the chiral symmetry SU (2) i x SU(2)r and the custodial symmetry as 

(Al,Al,Al): (3,1) and 3 (8.65) 

5": (1,3)+ (1,1) and 3+1, (8.66) 

where the representation of the U(1)y gauge boson breaks into the two pieces 
(1,3) and (1,1), since the weak-hypercharge may be written as y = Ur + --A,- 1 -'-, 
where I 3 R is the eigenvalue of the 3rd component of “right-handed weak-isospin” 
SU(2)r and B, L denote baryon and lepton numbers respectively. (Or we may 
rewrite the Nakano-Nishijima-Gell-Mann relation as Q = I 31 + I 3 R + in 
a left-right symmetric form, which really holds in the left-right symmetric gauge 
theory 517(2)/, x SU(2)r x U{1)b-l, though in the gauge group of the standard 
model, we are taking now, the SU(2)r is not gauged.) 

Paying attention to the fact that Y denotes the mixing of 

517(2)/, and U{\)y gauge bosons in the kinetic term, and that Hu — ^3 = ^pln + 
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II22] - II33 we realize the representation of the oblique parameters under the chiral 
and custodial symmetries: 

5: (3,3)+ (3,1) 1 + 3 + 5 (8.67) 

T, U : (5,1) 5. ( 8 . 68 ) 

where in the representation of 5, 1 + 5 comes from (3.3) while 3 comes from (3.1). 
Let us recall the simple algebra in SU (2) group theory, 3x3 = 1 + 3 + 5, and 
note that only 1 and 5 are relevant for us, since vacuum polarization functions 
should be symmetric under the exchange of gauge indices. The linear combination 
|[IIix + II 22 ] — II 33 can be expressed by a traceless 3x3 matrix and should belong 
to 5 representation of 51/(2), while traceful combination IIn + II 22 + II 33 behave 
as the singlet. It is now easy to understand that T, [/-parameters should vanish in 
the limit of custodial symmetry: T, U behave as a non-singlet of 51/(2)y, namely 
as 5, and should vanish in the limit of exact SU(2)v symmetry, since in the all 
orders of quantum corrections, quantities contradicting with the symmetry never 
appear. In such sense, the symmetry handling the T, U or A p is the custodial 
symmetry. Strictly speaking, it may be more suitable to say that actually what 
handle these parameters is the global 51/(2)/, rather than the custodial symmetry 
(Inami-Lim-Yamada, 1992) simply because these parameters are concerned only 
with SU(2)l ■ (In fact to get 5 representation we need quartic term of quark masses, 
or equivalently an operator quartic in the Higgs field, as we will see in the following 
subsection, while the quark mass matrix M q = ^diag (f u , fd) has a piece of 3 
under SU(2)v and quadratic term of quark mass is enough to get 5, if we invoke 
to the custodial symmetry. This becomes clear in the operator analysis extended 
in the next section.) Then, what symmetry handles the 5-parameter ? This time 
obviously custodial symmetry is not the one, since the part (3,3) of 5 contains a 
singlet 1 under the SU(2 )y, the constant term ^ in (8.57), which survives even 
in the limit of degenerate doublet {nit = %)• We may say that the relevant 
symmetry for the parameter 5 is just the chiral symmetry 517(2)/, x SU(2)r, as 
the 5-parameter behaves as (3,3), therefore not as an invariant under the symmetry 
(Inami, Lim and Yamada, 1992). In this way we can readily understand why each of 
the technifermion doublet gives additive contribution to 5: each of the condensation 
< TuTu >=< TdTd > break the chiral symmetry spontaneously, thus contributing 
to 5. 


8.5 Operator Analysis 

We have seen T, for instance, is calculable finite quantity, although the gauge bo¬ 
son mass-squared seems to be the coefficient of relevant operators with d = 2 , 
W + W~ etc. As was discussed in section 8.1, in the renormalizable theory a calcu¬ 
lable quantity should be a coefficient of some irrelevant operator. It is known that 
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the T-parameter is described by the coefficient of the following irrelevant operator 
(Problem 8.6) 

0 T = (</>+ D, <f>)( 4 >* D» 4>)~ \ D „£>** 4>){ct>U)- (8.69) 

Similarly, 5 parameter is proportional to the coefficient of another d = 6 operator, 

Os = W {Fly) <j>\ B^. (8.70) 

It is easy to see that when the replacement (f> -A (0, ^) 4 is made this operator 
causes the mixing between the field strengths of SU(2)l and U(l)y, i.e. F 2 V and 
B 

It is now obvious why the S,T, [/-parameters are all calculable finite quantities: 
they are all described by the coefficients of higher dimensional d > 4 irrelevant 
operators, such as those listed above, which do not exist in the original Lagrangian. 
Thus the coefficients should be all finite as long as the theory is renormalizable. Now 
the difference of decoupling and non-decoupling effects of heavy particles is also easy 
to understand. The examples we discussed above tell us that when heavy particles’ 
contributions to physical observables are decoupling ones, there is a characteristic 
feature that heavy particles get their masses through some new mass scales, such as 
Msusy, which are inevitably gauge invariant. In that case, the coefficients of the 
irrelevant operators induced by a heavy particle will be suppressed by the inverse 
powers of M, with M denoting a generic new large mass scale. For instance the 
effective Lagrangian for the 5-parameter behave as 

~ jp W 0 Vly)4>] (8.71) 

as is clear from dimensional analysis. This means the heavy particle contribution 
is suppressed by the factor 1/M 2 . On the other hand, we have learned that when 
heavy particles’ contributions are non-decoupling ones, heavy particles get their 
masses through “strong” couplings with Higgs, the origin of the masses being the 
VEV of the Higgs (SSB), without demanding any new large mass scales. In this 
case M\y is a unique mass scale and the coefficients of irrelevant operators are not 
suppressed, and even enhanced by the “strong” couplings in some suitably chosen 
cases. This is why such contributions are non-decoupling. 

One defect of the operator analysis extended above is that we may put arbitrary 
powers of a gauge invariant factor <j)^<f> to a given irrelevant operator, all of them 
giving the same operator in the broken phase. When we are considering some 
decoupling effects, it will not be a real problem, since the insertion of </>f cj> will make 
the coefficients of the operators more suppressed by higher inverse powers of M, 
such as 1/M 4 ,1/M 6 , which are less important. In the case of non-decoupling effects, 
however, such further suppression is not the case, as (^<p)/v 2 -A 0(1) after the 
replacement of the Higgs by its VEV. This problem is evaded once we move to the 
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non-linear realization of the Higgs field, as discussed in section 8.2: 

l/Ee^, (8.72) 

where the physical Higgs field H has been ignored, since it does not appear in the 
processes we are interested in (at 1-loop level). In this non-linear representation we 
do not have the problem of getting infinite number of operators for one observable, 
since WU = I is a c-number. Analysis based on the non-linear realization shows 
that potential non-decoupling effects in gauge boson 2 and 3 point functions are 
given by 7 independent operators. The reason why we can treat both of the 2 and 
3 point functions on an equal footing is that in non-Abelian gauge theory they are 
mutually related. For instance, Os not only has gauge boson 2 -point function, but 
also 3-point function as well, since has both linear and quadratic terms of gauge 
bosons. 


Problems 

8.1 Calculate the diagrams in Fig.8.3 to get the formula for A p in (8.12). Also 
show that ( 8 . 12 ) vanishes for rn t — mb. 

8.2 Show that (8.17) is approximated by (8.18) when m? u d Mg USY (m| — 
m u + MsvsY> m \ — m d + M^usy)- 

8.3 Verify the relation in (8.21) at the 1-loop level. 

8.4 Prove that Z+ defined in (8.25) is finite. 

8.5 Verify the formula (8.57) by use of (8.40), (8.41) and (8.48). 

8.6 Show that the T-parameter is given as the coefficient of the operator Or in 
(8.69). 
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Chapter 9 

FLAVOR PHYSICS AND CP 
VIOLATION 


9.1 The Interest in Flavor Physics and CP Violation 

The gauge sector of the standard model (SM) is theoretically uniquely determined 
by gauge principle. The Higgs sector, on the other hand, has many arbitrary pa¬ 
rameters, whose values cannot be theoretically predicted. Especially so far we have 
no definite idea about the origin of quark and lepton masses and generation or fla¬ 
vor mixings, i.e. the origin of the Yukawa couplings of Higgs field. Issues related 
with quark and lepton flavors, “flavor physics”, is therefore very important clue, not 
only to the the confirmation of the standard model, but also to the search for some 
theories beyond the standard model, “New Physics”, where the origin of flavors is 
expected to be understood more deeply. 

Among such flavor physics, Flavor Changing Neutral Current (FCNC) processes 
are of special interest. They do not exist at tree or classical level in the standard 
model, and are induced only at loop or quantum level. Therefore the observation of 
such FCNC processes will provide us valuable information on the contributions of 
all particles as the intermediate states. They are, therefore, very suitable, not only 
to get information about the flavor mixings of relatively heavy t quark with lower 
generations, but also to search for possible heavy particle effects of New Physics. 
Namely the studies into the FCNC processes is important for the progress of particle 
physics. The rates of such FCNC processes are known to be suppressed by both of 
the higher order of the perturbation theory and the small mass difference or small 
flavor mixings. Thus, the FCNC processes are often called rare processes. 

In fact, historically the rare processes have played very important role in the 
foundation of the particle physics. We may list up some of the most important 
events, some of them have already been discussed in 4.6.2: the introduction of 
c quark, in order to naturally suppress the rare processes of neutral kaon such as 
K° ■(-> K° mixing, by GIM (Glashow, Iliopoulos and Maiani, 1970), the prediction of 
the mass of such predicted c quark by Gaillard and Lee (Gaillard and Lee, 1974), the 
introduction of the third generation to implement the observed CP violation in the 
neutral kaon system by Kobayashi and Maskawa (Kobayashi and Maskawa, 1973), 

215 





216 


FLAVOR PHYSICS AND CP VIOLATION 


which embodies the standard model nowadays, the lower bound on mt imposed by 
the data on B° <4 B° mixings, obtained before the direct discovery of t quark, etc. 

The CP-violating processes are also known to happen rather rarely in neutral 
kaon system. In such sense, the physics of CP violation also provides valuable 
information of the heavy t quark and/or heavy unknown particles of New Physics. 
We should further note that the Kobayashi-Maskawa (KM) model of 3 generations 
was originally devised as the theory to accommodate CP violation. In spite of the 
remarkable success of the KM model, such as N„ = 3 (N„: the number of light 
neutrinos) at LEP, yet there has been no final conclusive argument on the origin 
of CP violation. So, the confirmation of the prediction on CP violating observables 
of the model, such as CP asymmetries in B decays, is an urgent necessity to really 
establish the standard model. We may even find some evidence of some New Physics 
in the course of the investigation. 

The FCNC processes and CP violation is, in principle, mutually independent. In 
fact, in the model of spontaneous (or soft) CP violation (Lee, 1974; Weinberg, 1976), 
the CP violation is attributed to the complex phase of the vacuum expectation value 
(VEV) of the Higgs created by the suitably chosen scalar potential, and has no direct 
relation with the flavor mixing. On the other hand, in the mechanism of KM, both 
of quark mass matrix causing the flavor mixing and CP violation come from the 
same Yukawa couplings. Therefore they are mutually very closely related. In fact 
in KM model we will see below that the CP violation needs mass differences and 
flavor mixing, (of course) in addition to the complex phase in the KM matrix. CP 
violation, therefore, is observed in FCNC processes, such as K° <4 K° and B° <4 B° 
mixings. 

The purpose of this chapter is to discuss the rare processes due to FCNC and 
the CP violation. We assume the theory we work on is the standard model, since to 
provide the results in SM is quite useful even if we further investigate New Physics. 
Furthermore, some of the formulae we derive are readily applicable for the class 
of New Physics, where the Yukawa coupling of the Higgs has the same structure 
as that of the standard model, e.g. the model with four or more generations, 
minimal supersymmetric stadard model. In this chapter we will discuss FCNC rare 
processes only in the quark sector. Once neutrinos become massive, the lepton flavor 
violation, i.e. FCNC processes in lepton sector, also becomes physically meaningful. 
We, however, will not deal with the lepton flavor violation here, since the neutrino 
oscillation as the typical example of lepton flavor violation has been extensively 
discussed in chapter 6. 


9.2 Flavor Symmetry and FCNC Rare Processes 

As was discussed in 4.6.2, GIM escaped FCNC processes at tree level, by introduc¬ 
ing c quark. What GIM has proposed is that by introducing c quark, the weak 
isospin of s quark became the same as that of d quark and even after the uni- 
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tary transformation to the mass eigenstates the neutral current is kept flavor- or 
generation-diagonal. We may say that the essence of their idea is to make the 
gauge interaction of each sector of fermions with definite electric charge and chiral¬ 
ity universal, i.e. to make it invariant under global SU(n g ) transformation, with n g 
being the number of the generations ( n g = 3 in KM model). If we call the gauge 
interaction to connect the elements of doublet “vertical symmetry”, this flavor (or 
generation) symmetry may be called as “horizontal symmetry”. By imposing such 
global flavor symmetry, the FCNC, such as s 7 M d coupled with Z boson or photon are 
strictly forbidden at the tree level. We call such flavor conservation in the neutral 
currents as “natural flavor conservation”. The concept of natural is used to denote 
some property which is naturally guaranteed by some symmetry of the theory irre¬ 
spectively of the tuning of the parameters of the theory. From such point of view, 
for instance, small Higgs mass in the standard model is not natural (the hierarchy 
problem), but it can be naturally small in the SUSY standard model, etc. Glashow 
and Weinberg have summarized the conditions for the natural flavor conservation 
in neutral currents coupled with gauge or Higgs bosons (Glashow-Weinberg, 1977): 

1. Fermions with the same electric charge and chirality should belong to the same 
representation of the gauge group. 

2. Fermions with definite electric charge should couple with only one Higgs doublet. 

The first condition is the condition for the neutral current gauge interactions do 

not have FCNC. If the condition is met, a neutral current jj?' 1 generally has a form 
in the base of weak-eigenstates 

(uiL,U 2 L, ■ ■ ■ ,Wr,U 2R, ■ ■ - ,diL,d2L, ■ ■ -,diR,d 2 R, ...) 

x diag(a, a,..., b, b, ..., c, c,..., d, d, ...) 

x {uil, u 2 l, ... ,uir,u 2 r, ... ,dn, d 2 L, ■■■, diR, d 2 R ,.. .) l , (9.1) 

and this form is invariant under the unitary transformations to the mass eigenstates, 
since the transformation is among the fields with the same electric charge and chi¬ 
rality, such as uil, u 2 l, ■ • ■> and therefore the unitary matrix for the transformation 
takes the form of block-diagonal: 

C4 di ag(a, a, ...)U L = diag(a,a,...), (9.2) 

etc. The second condition is necessary for the Yukawa coupling to naturally con¬ 
serve flavors. If, for instance, up-type quarks are allowed to couple with two Higgs 
doublets (pi,<t> 2 , their neutral Higgs fields generally have the following Yukawa cou¬ 
plings, 

Uzl[fij )l fi 0 i + fij ] V 2 \ u jR + h.c.. (9.3) 

One linear combination of the Higgs fields, , * 2 [nt <fi + v 2 (< '-p® 2 >= 

V v l+ t; 2 ’ 

has a flavor-diagonal Yukawa coupling with the mass eigenstates u,c,... in 
accordance with the diagonalization of the mass matrix for up-type quarks, while 


Pitfu* 7/faf/te.maf /clclL 'iA 



218 


FLAVOR PHYSICS AND CP VIOLATION 



Fig. 9.1 


the Yukawa coupling of the orthogonal linear combination 


y/vf+V[ 


= [v 2 $ - v i <p$] 


causes the FCNC at the tree level. Let us note that in SUSY standard model, 
because of the chirality (holomorphic property) of the super-potential, even though 
we have two Higgs doublets, there exists a selection rule to forbid the simultaneous 
Yukawa couplings of the two doublet Higgs. 

Since the standard model satisfies the Glashow-Weinberg’s condition, the FCNC 
is forbidden at the tree level. The flavor symmetry SU(n g ), however, is broken by 
the quark mass differences in each of up- and down-type quark sectors, i.e. by 
m u ^ m c ^ m t etc. We may still have symmetries of each generation number, i.e. 
a sub-symmetry U( l)" 9 , if there is no generation mixings, but we know that there 
exist such mixings and the flavor symmetry is thus completely broken. We thus 
expect that at the quantum or loop level of Feynman diagram the FCNC processes 
are induced by charged current interactions, which have generation mixings via 
KM matrix U. In Fig. 9.1, we have a diagram with VF + -exchange, which induces 
effective dsZ and ds'y vertices. 

The amplitude of Fig. 9.1 is expected to vanish for an exact SU (3) symmetry, 
m u = m c — mt- In fact, we easily find this is the case, since in this case the 
orthogonality of the KM-matrix U* d Ui s — 0 works. Thus, to get FCNC we 
need the violation of flavor symmetry due to both of non-degenerate quark masses 
at up- (and down-) type quark sector, and flavor mixings. More intuitively we 
may say that when the degeneracy m u = m c = mt holds there is no mean to 
distinguish the ( u,c,t ) quarks. Thus we may treat ( u',c',t') obtained by arbitrary 
unitary transformation of (u, c, t) as mass eigenstates with an equal right. Thus it 
is always possible to choose the unitary transformation of the left-handed up-type 
quarks, so that it makes the KM-matrix unit matrix, thus making the generation 

mixing meaningless. Suppose we have only two generations with a mixing angle 9 C . 

2 2 ~ 

Then the above amplitude is proportional to U* d U cs = cos# c sin#, 


2 ^ 
aTH7 


6 x 10 -5 . We learn from this simple exercise that the amplitudes of FCNC processes 
are suppressed by small mass-squared differences (the GIM mechanism). If we 
include the third generation the relevant mass differences mf — rn^ c are not small 


~Pu.f lc /tc.»t(if ta/ 



Rare Processes in Kaon System 


219 


compared with M^,. But, on the other hand, the mixings of t with 1st or 2nd 
generations are small compared with sin# c , thus making the amplitude rather small. 
Thus, the rates of FCNC processes are tiny and these processes are often called “rare 
processes”. It is worth noticing that the FCNC processes in quark sector and the 
neutrino oscillation in lepton sector are both induced by the mass differences of 
different generations and generation mixing. This is not surprising, as the neutrino 
oscillation is one of FCNC processes. 


9.3 Rare Processes in Kaon System 

As we have seen in the introductory argument in section 9.1, the rare processes in 
neutral kaon system of (K°, K°) have played very important roles in the establish¬ 
ment of the standard model. (K°,K°) are the bound states of s and d quarks and 
their anti-quarks, K° ~ § 75 d, K° ~ J 75 S. There is a freedom of relative phase 
of these two states, but here we adopt a convention CP \K°) = — | K°). We may 
define a quantum number (a 17(1) charge) “strangeness” S carried only by s and s 
quarks, S = -1 for s, S' = 1 for s, S = 0 otherwise. The neutral kaons, K°, K°, 
thus have 5 = -1, 1, respectively. In the FCNC processes of kaons, the strangeness 
changes, and we may classify the rare processes by the extent of the change of the 
strangeness: 

(a) |AS| = 2 : K° <-> K°, 

(b) |AS| = 1 : K l -> /z/i, K+ ->■ tt+i/P, 

where in the decay K + —> 7 the contributions of neutrinos of all generations 
should be summed up, as long as they are light. We discuss these two types of 
FCNC processes separately below. 

(a) |A5| = 2 process 

The CPT theorem tells us that the masses and the life time (total decay width) 
of particle and its anti-particle should be exactly the same. Thus if only strong 
interaction QCD is switched on, K° and K°, being anti-particles of one another, are 
two degenerate states without any interaction (as the strong interaction preserves 
the strangeness S ). The situation is somehow similar to the case of double-well 
potential with very high wall separating two degenerate vacuum states | 1 ), | 2 ) in 
the quantum mechanics (see Fig. 9.2). 

In the case of double-well potential, as the height of the barrier wall gets reduced 
the tunneling effects connecting two states |1), |2) starts to operative. Then the 
Hamiltonian in the base of these states takes a form 


/ Eo Et\ 
\Et Eq ) 


(9.4) 




H = 
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V(x) 



Even if the tunneling effect E t is small, the energy eigenstates get large modification, 

^[|1>-|2>], ^[|!) + |2)], (9.5) 

with energy eigenvalues E 0 - E t and E 0 + E t . 

Similarly, when weak interaction is switched on to the neutral kaon system, 
the strangeness is no longer preserved and FCNC process K° K° arises. The 
Hamiltonian in the base of (K°, K°) at the rest frame now reads as 


f M \ 
\Mh M)' 


(9.6) 


where M is the common mass implied by the CPT theorem and is due to the 
K° *-> K° mixing. For simplicity, here we have ignored absorptive part, denoted by 
T, coming from the on-shell intermediate states, which correspond to the fact that 
the neutral kaons are unstable states decaying into nn etc. Thus the Hamiltonian is 
hermitian, though we will treat non-hermitian Hamiltonian including the absorptive 
part later in this section. We further simplify by ignoring a small CP violating 
effect in the element M \2 due to the CP phase of KM matrix for a while, assuming 
Ml 2 — Mi 2 . Then just as in the case of quantum mechanics, we find two mass 
(energy) eigenstates 


\ R i) = ^ [l*°> - \K°)b 


\K 2 ) = -j=[\K°) + \K°)], 


with the mass eigenvalues 


(9.7) 


mi = M — M\2, m 2 = M + M\2 . 


(9.8) 


As we have ignored the possible CP violating effect, these states have definite CP 
eigenvalues: 

CP\Ki) = \Ki), CP \K 2 ) = -\K 2 ). (9.9) 
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Fig. 9.3 

Actually when we include the CP violating effect and the absorptive parts, real 
eigenstates of the Hamiltonian are Ks and Kl states with shorter and longer life 
times, which are nearly equal to K l and K 2 , respectively. Their mass difference 

A tuk = rriK L ~ mx s — 2Mi 2 , (9.10) 

is an observable, and is known to be extremely small: A ttik = (3.552 ± 0.016) x 
10 -12 MeV. 

The elementary process to cause the mixing at the quark level is the 1-loop 
diagram, called “box diagram” (See Fig. 9.3). 

In Fig. 9.3, are the would-be Nambu-Goldstone boson, which has a propaga¬ 
tor in R,£ gauge (discussed in the gauge theories with spontaneous gauge symmetry 
breaking), - tA -. As we see below the effective Lagrangian due to the top 

quark exchange in this box diagram has a term proportional to m 2 , which comes 
from the diagram with ip + and ip~ exchanges (at least for £ ^ 0), which has 4 
Yukawa couplings yielding a factor ( gm t /Mw) A • The factor is then multiplied by 
l/mt 2 (for mt > Mw) coming from dimensional analysis, thus producing a net 
effect (p 4 m 2 )/M^, a non-decoupling effect of heavy t quark. Note that this box 
diagram is only possible quark diagram at 1 -loop level; the diagrams with 1 -loop 
induced FCNC Z- or 7 (photon)-vertices, which are relevant for |A5| = 1 processes, 
such as Kl -t pp, changes the strangeness only by 1 unit and do no contribute to 
the IASI = 2 process K° «-> K°. In the pioneering calculation by Gaillard-Lee, 
an assumption that the masses of intermediate up-type quarks are much smaller 
than Mw was made, which is a good approximation in the 2 generation (4 quark) 
model, m c Mw, which was the standard picture of that time. The calculation, 
however, needs to be revised, since we now know that there are 3 generations of 
quarks and mt is greater than the weak-scale Mw- The calculation which is valid 
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for the arbitrary masses of intermediate up-type quarks was performed (Inami and 
Lim, 1981) and the result can be represented in a form of effective 4-Fermi |AS| = 2 
operator (see also (8.14)): 

4// 1=2 = % - X J^T 0 E ( V uV id )(V; s V jd ) E(xi, Xj ) (sj.LdKs^Ld), 

Y W ij= c ,t 

(9.11) 

where the contributions of two internal quarks iq, Uj ( i,j = u,c,t ) are contained 

2 

in the coefficient function (with X{ = jA~) 


E(xi,xj) 


-XiXj{ 


Xi — xj 4 


1 ( 7 - 


4 (xi - l){xj - 1) 




3 1 

2 Xi - 1 


3 1 

4 (Xi - l) 2 


) In Xi - (xi -A Xj)] 


(9.12) 


and L = V) s etc. are K-M matrix elements. We have replaced the original 
coefficient function E(xi,Xj) by E(xi,xj) = E(xi,xj) — E(x u ,Xj) — E(xi,x u ) + 

/v 2 

E(x u ,x u ), setting x u = = 0. The additional three terms exactly vanish by 

the orthogonality of K-M matrix J2i = 0, J2j k)* Vjd = 0. This is why the 

summation of i,j is only over c and t. When both intermediate quarks coincide we 
get 


E(xi) = E(xi,Xi ) = -|( X '_ 

Z Xi 1 


) In Xi - Xi[- - 


1 9 1 


4 4x{ — 1 2 (xi - l) 2 


(9.13) 


Though the above result is valid for an arbitrary intermediate quark masses, the 
external quark masses ra^, m s and their 4-momenta have been ignored there, which 
is a good approximation. To get M 12 , we need to evaluate the matrix element of 
the 4-Fermi effective operator with respect to kaon states, 

(K°\(s ltl Ld)(s^Ld)\K°) = | flm\B , (9.14) 


where fx, m# are decay constant and the mass of meson (mx — M in (9.6)), and 
the B is the “bag-parameter”, which parameterizes the ambiguity due to the non- 
perturbative QCD effects to form the bound states K° and K°, and the reference 
value B = 1 corresponds to the vacuum saturation, ( K° | (sy tl Ld)\0) (0|(s 7 M Ld) \K°). 
Thus writing the off-diagonal element of the neutral kaon mass-squared matrix as 

6 m 2 ee -(K°\£[ A f f= 2 \K 0 ) 

= [(V*V id )(V; s V jd ) E( Xi ,Xj)], (9.15) 


the mass-squared matrix reads as 


/ M 2 8m 2 \ 
\dm 2 M 2 / ’ 


(9.16) 
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and taking the square-root we obtain the Hamiltonian at the rest frame, with an 
approximation 5m 2 -C M 2 , 


( M 

( 6 m 2 
\ 2 M 


6 m 2 
2 M 


M 2 y ’ 


(9.17) 


which means M 12 ~ We thus obtain 


A mpc — 


,, Re<5m 2 

2ReMi 2 - ——- 
M 

-^f fasin ^e w f * mKB Re ^ V ^s V id)} E( Xi , Xj ) 

-3.8 x 10- 2 m K £ ReK^Hid)^*^)] £(*<,*,-)■ (9-18) 


In the case of restricted 2 generation model, noting E(x c ) ~ —x c for i c C 1, we 


get 


\ otG F r K m K „ (sin 6 C cos 0 C ) 2 m 2 

Amif ~ -7=-5—— ts -- 

6\/27rsin 2 6w M w 


(9.19) 


which reduces to the result obtained by Gaillard and Lee. Comparing this with the 
observed Arrix they succeeded to predict the mass of c quark to be m c ~ 1.3(GeV), 
before the discovery of the c quark. Unfortunately, in the full 3 generation model, 
the generation mixings of t quark with lower generations have not been determined 
precisely, thus making the prediction of mt not straightforward. In addition, though 
the above result is suitable for perturbative contributions of heavier quarks c, t, there 
may exist non-perturbative effects of intermediate u quark, which make our predic¬ 
tion not conclusive. The CP violation observable e discussed later, however, being 
proportional to Im(V)* V {,1 V* s Vjd), does not suffer from such problem, since the inter¬ 
mediate u quark does not contribute, and the above formula is directly applicable. 
Thus by use of e we can impose a useful constraint on the generation mixings and 
CP phase, namely on the unitarity triangle discussed later in this chapter. 


(b) |A,S| = 1 processes 

The typical |AS| = 1 processes are Kl —> Rp- and K + —> -n + vD. ( Kl —► 7 r°e + e _ 
is the signature of CP violation and is another interesting process to study. But 
the derivation of its decay rate is obtained in a similar manner with the argument 
below, and will not be given.) 

Now the diagrams with 1-loop induced FCNC Z- or 7 (photon)-vertices also 
contribute to the processes, and the quark Feynman diagrams, relevant for the 
Kl -> pp and K + —>- tt + vu, are shown in Fig. 9.4, where vi denotes all possible 
light neutrinos (l = e,p,r for the standard model). Note that the 7 -exchange 
diagram obviously does not contribute to the decay K + —> tt + vD. The 1-loop 
induced FCNC Z- or 7 (photon)-vertices, denoted by blobs in Fig. 9.4, are the sum 
of the sub-diagrams shown in Fig. 9.5. 
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Fig. 9.4 


One may wonder why we should include the 1-particle reducible diagrams with 
FCNC self-energies such as Fig. 9.5(a), and wonder if such FCNC self-energy should 
be eliminated by adding a suitable counterterm for the external quarks d, s to have 
physical meaning as the eigenstates of propagation. Such procedure is also pos¬ 
sible, but it turns out that the contributions of all counterterms just cancel with 
each other in the FCNC processes (Botella and Lim, 1986), intuitively because the 
counterterms for the FCNC quark self energy and for the FCNC vertices of Z or 7 
gauge bosons are not independent. 

We first display the results for the 1-loop induced effective FCNC Z- and 7 - 
vertices (Inami and Lim, 1981): 


FsdZ 


1-sd-y 


a _ 9 

47 rsin 2 #iv cos6 w 


E {V£V iA )T Z [x i )(gL'1»d L ) Z», 

i=c y t 


a e 
47rsin 2 #w 2 Myf 


E (o^) 


(9.20) 


s[Fi(x i )(g 2 7 #1 - q^i)L + F 2 (x i )cr^iq' / {m s L + m d R)]d A* 1 , (9.21) 


where the effective sd'y vertex is given in the momentum space {q^ = p$ — pfa). The 
coefficient functions are given in terms of the interemediate up-type quark masses 


Xi = mf/My/ (i = u, c, t ) as 


r z {xi) = 

r 1 3 1. 3 2x? - Xi 

4 8 I| -l 1X, + 8(* i -l)* lna: ‘ + '' (Xi ' f) ' 

(9.22) 

Fi(xi) = 

„„ 1 1 , 13 1 1 1 

W “l2*i-1 + 12(xi-l) 2 2 {xi — l) 3 Xt 


+ 

r 2 1 2 1 5 1 11 

3 Xj — 1 3(xt-l) 2 6 (xj — l) 3 2 (x, — l) 4 Xl UXt 
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F2(Xi) 


+ 


7 1 13 1 1 1 

L 3xi-1 + I2(xi-l) 2 2 ( Xi — l) 3 Xi 

1 1 35 1 5 1 1 1 

6 Xi - 1 12 (xi - l) 2 6 (x{ - l) 3 + 2 ( Xi - l) 4 Xl nXl 

, 0 . 

1 1 3 1 3 1 3 xf 

“ 4xi - 1 + 4 (xi - l) 2 + 2 (xi - l) 3 ^ Xl 2 (xi - l) 4 

r 1 1 , 9 1 3 1 , _ 3 xl 

l 2 Xi - 1 + 4 (Xi - l) 2 + 2 ( Xi - l) sJ Xt 2 ( Xi - l) 4 


(9.23) 

Inij} 


(9.24) 


where the term ln(^-) in Fi (x ^) has an infra-red singularity at the limit x u , m u —> 0, 
and x u dependence has been kept only there. This term just corresponds to the 
contribution of “penguin-diagram” with gluon-exchange (see Fig. 9.6) discussed 
below (we may call the logarithmic contribution as the contribution of “electro- 
weak penguin” diagram). In Q denotes the electric charge of the internal 

up-type quarks, i.e. Q = |. We have left the factor Q free, so that we can clearly 
distinguish the contributions of the different types of diagrams where the photon is 
attached to the internal or external quarks, respectively. This makes, for instance, 
the calculation of sdg (g : gluon) vertex, namely the calculation of the gluon- 
penguin-diagram, really straightforward (Problem 9.2). The appearance of the f- 
dependent term 


7(*,0 = 


+ 


+ 


3 1 


1 1 
+ t- 


8 (£a-l) fx-l v 4a:-l 8£a-l 
r_( 5 i +l)s _ + 2 x - jln ^ 


-)a;ln x 


8 L «*-i)K-l) (^-1) 2J 


(9.25) 


implies that the effective vertices are not gauge invariant, though the Pauli-term 
accompanied by Fi is gauge invariant, since only this term contributes to the decay 
s —> dj with the on-shell condition q 2 = 0. The remaining gauge dependence is 
cancelled when the contributions from box diagrams are also summed up, as we will 
see below. 

Let us note that the sdj vertex has two pieces, the terms with the currents 
[s(ry 2 7 M - g M ^)d] and [sa, tv d)q v , called “charge radius” and “Pauli” terms, respec¬ 
tively. These currents automatically vanish (without the use of equations of motion), 
when multiplied by , thus satisfying the current conservation of electromagnetic 
current (CVC). Note that in the FCNC vertices the current conservation is not 
satisfied by use of the equations of motion, since s and d have different masses, but 
satisfied by these specific forms of the currents including the external momentum 
. Thus the sd'-f vertex vanish in the limit of =0, and we need to calculate the 
diagram keeping the 0 (q 2 ) terms, in contrast to the case of calculations of other 
FCNC effective Lagrangians. This proportionality to q 2 makes the non-decoupling 
effect of heavy t quark rather mild in the sdj vertex, i.e. In (rrit/Mw), while in 





226 


FLAVOR PHYSICS AND CP VIOLATION 



Fig. 9.5 


sdZ vertex we have an non-decoupling effect proportional to rrif , just as in the box 
diagram (see (9.22)-(9.24)). The reason is rather easy to see; in the sdZ vertex 
the ip + -exchange gives the Yukawa coupling factor m t /Mw twice, thus giving the 
non-decoupling effect proportional to mj, while in the sdj vertex, the additional 
q 2 factor enforces the remaining piece to have mass-dimension d = — 2 , behaving as 
1/mf, thus cancelling the rrij from the Yukawa couplings. In this way, when the 
FCNC 7 vertex or similarly FCNC gluon vertex is relevant, the contributions from 
the lighter u.c quarks become significant. Especially the penguin-diagram shown 
in Fig. 9.6 with FCNC gluon vertex is expected to play an important role in the 
explanation of “A/ = | rule” in K —> tttt decays. 

Write the effective 4-Fermi Lagrangian relevant for the decays of our interest as 

4/? =1 = 4?rs dw E (V'V^iCix^sZ^iwrPL) 

3 

- (9-26) 

j =i 

mf. 

where yj = with m/ ; . denoting the charged lepton mass of the j-th generation. 
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Fig. 9.6 


The coefficient functions are the sum of the contributions from the box diagrams 
and the Z-exchange diagrams in Fig. 9.4: 


C{xi) = Ca(xi) + Cz(xi), D(xi,ijj) = D a (xi,yj) + D z (xi). (9.27) 


We readily know that Cz(xi) = Dz(xi) = F z (xi). The contributions from the box 
diagrams are known to be 


Cb(®i) = 


Du{xi,yj) - 


+ 


3 Xi ^ 3 Xi 
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1 1 ,yj - 4. 2 . 
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(9.28) 


(9.29) 


Combining these two types of contributions we finally get gauge invariant coefficient 
functions 


s-i / , 3 , Xi ,0 . 13 Xi 

C(Xi) = 4 ( i7^T> lnIi + 4 I ‘-4^' 


1 r X 

+ ol 


Xi 4,2 


) 2 + 1 + 3 


8 yj - Xi Xi - 1 

1 3, 1 , Xj 

+ 7 x i — Q (1 + 3-~)-r 

4 8 yj — 1 Xi — 1 


(Xi - l ) 2 


\xAnxi 


(9.30) 


(9.31) 


Note that the piece with vector current does not contributes to the Kl —> 

p/2 decay. This is because when q u coming from the hadronic matrix element 
(Olsi^dtl-Kz,) = — |(0|s7 M 75d|FfL) is multiplied to the leptonic current, the CVC 
implies = 0. Thus the photon-exchange diagram in Fig. 9.4 does not 

contribute to the process Kl —> p/2, as the matter of fact. 
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Fig. 9.7 


We thus obtain formulae for the ratios of the branching ratios of two processes to 
those of reference decay processes, in terms of the coefficient functions given above, 

Br(K L —> fip)sd t(K l ) a 2 [Re(X\ =c , t V^C^))] 2 

Br(K+ - fivj ~ t(K+) Urrsin 2 9 W ’ V* s ’ 1 j 

Br(K + -* t x + u9) _ a 2 ^ \'Ei =c ,t V i* s v idD(x i ,y j )\‘ 2 , . 

Br(K + —* TT°e + v e ) l 47rsin 2 y (K s ) 2 ’ ’ 


where the subscript “sd” denotes the short distance (or perturbative) contribution 
to the process, without including the two-photon process discussed below, and in 
the estimation of Kl —» fip the difference of phase spaces of Kl and K + decays 
have been ignored. In the case of Kl -a fj.fi, there is an additional contribution from 
FCNC two photon vertex, as shown in Fig. 9.7, whose presence makes the precise 
prediction of the branching ratio difficult, though including the two photon process 
the above formula gives a reasonable prediction, compatible with the observed value 
Br(K L -> pp) = (7.25 ± 0.16) x 10~ 9 . 

On the other hand, the branching ratio Br(K + — > Ti + viJ) can be reliably esti¬ 
mated, as the photon process does not contribute to this decay. When the values of 
quark masses and generation mixing angles (or KM matrix elements) are substituted 
and an approximation yj = 0 is used, i.e., 


D(xi) = D(xi,0) = - 


3 Xi — 2 Xi 3 Xi tn oa\ 

7 7-777 xMxi + — + ---, (9.34) 

4 (Xi - l) 2 4 4 Xi - 1 


the above formula gives Br(K + — > tt + uD) of the order of 0(1O -10 ), which seems 
to be quite consistent with the recent result of BNL experiment, which has claimed 
the observation of the rare decay. 

We learn from (9.7) that K° or K° is the mixture of K\ and K 2 , or more 
precisely Ks and Kl, with mass difference Amu due to the K° <-» K° mixing. 
Hence, if we can prepare an initial state of K° or K°, it’s existence probability 
will oscillate in time due to the interference effects of matter waves of Ks and Kl, 
just as the neutrino oscillation discussed in chapter 6. A feature not shared by the 
neutrino oscillation is that neutral kaons decay into tttt etc., when they propagate. 
Thus the time evolution Hamiltonian have (effective) imaginary or non-hermitian 
part to effectively represent the decay. In order to accommodate the absorptive 
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part and possible CP violating effects we correct (9.6) as 

( M-|r m 12 - fr 12 \ 

VM x * 2 -in 2 M-|r 


(9.35) 


where M, T, corresponding to the mass and the decay width of neutral mesons in the 
absence of the mixing, are real, while M i2 and r ]2 may have imaginary parts due 
to CP violation. Though this Hamiltonian is applicable to both of neutral kaon and 
neutral meson system {Bd ~ 675 ei, B d ~ (J 756 ; B s ~ bj 5 s, B s ~ s%b), here we 
will discuss the neutral B meson oscillation. (Here we discuss only (B d , B d ) system, 
and will simply use the abbreviation B° for B d , unless otherwise mentioned.) One 
reason for this is that in the neutral kaon system the lifetimes of Kl and A '5 are 
too different for the interference to be operative, while in the B meson system the 
difference of lifetimes due to IT 2 is very small. 

The eigenstates of the Hamiltonian 


\Bi) = 

vTpF+w <p|b0> ~ 

(9.36) 

\B2) = 

|p| 2 + |g| 2 B 

(9.37) 


where 

£ 

P 

are known to have eigenvalues 
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■ir** 
2 1 12 
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M12 — £r 12 


(9.38) 


with 


1 

Ai, 2 = m i i2 — - ri )2 , 


(9.39) 

AA = A 2 — Ai = Am — 

i Ar 


(Am = m 2 —mi, AT = 

r 2 -ro, 

(9.40) 

- l -AT = 2 M 12 - l -T 12 i 


(9.41) 


The time evolution of a state ip(t) = c(t) \B°)+c(t) \B°) is governed by a Schrodinger 
equation 



(9.42) 
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where the Hamiltonian H has been given in (9.35). The diagonalization of the 
Hamiltonian 
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enables us to solve the differential equation as 
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(9.43) 


(9.44) 


(9.45) 


Let |H(t)) and |B(t)) be the states with the initial conditions |H(0)) = |jB°), 
|H(0)) = |H°). As |B°), e.g. corresponds to c( 0 ) = 1,6(0) = 0 we easily find 


I B(t)) = g + (t)\B 0 )- V(t)|H°>, (9.46) 

r 

\B(t)) = g+ {t)\B°)-Zg-(t)\B°). (9.47) 


The “survival” and transition probabilities for a state starting as B° at time 0 to 
be found as B° and B° at time t are easily found to be 

P(B°-+B°) = |(B°|H(t ))| 2 = | ff+ (t)| 2 ~e- r W(^t), (9.48) 

P[B° -> B°) = |(B0|B(t ))| 2 = |^_(t)| 2 ^l^| 2 e- n sin 2 (“t)- (9.49) 

where the specific property of neutral B meson system AT <C Ti^, Ti~r 2 ~r 
has been used. Note that in neutral kaon system AT ~ T, which causes very 
different life-times for the two eigenstates, Kl, Ks- Why does such difference in 
two systems appear? In the B meson system, the generation mixings of t quark 
with lower generations are not suppressed compared with the mixing with the 3rd 
generation, 1 ^ 14^1 IV^bcdl ~ \V* b V t d\, as we will see below. On the other hand 
the, Am, AT, produced by the dispersive and absorptive parts of box diagrams, 
M 12 , r 12 , as shown in Fig. 9.8, get contributions from intermediate t and u,c 
quarks, respectively. 

In the case of absorptive part only u,c intermediate states are kinematically 

allowed to be on-shell. In the case of dispersive part all of u,c,t contribute, in 

principle. We, however, know that the coefficient function E{xi) of (9.13) grows 
2 

up as ~\xi — (non-decoupling effect) for 'J>> 1 and the t quark gives the 
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Fig. 9.8 


dominant contribution. Thus the absorptive part is suppressed compared with the 
dispersive part by 0(ml/mf) <C 1. This is the reason why Ar is negligible in the B 
system. The formulae (9.48), (9.49) remind us of those in neutrino oscillation (with 
maximal mixing). One remarkable difference is that in the B° B° oscillation, 
the sum of the probabilities decrease as the time goes on, which is the consequence 
of the loss of the unitarity due to the non-hermitian Hamiltonian. So, when we 
study observables sensitive to the oscillation, such as CP asymmetries in B°, B° 
decays, Am x r = ^ = x needs to be sizable, which is satisfied in B[ ], system, 

where x is known to be ~ 0.7. (For the £?°, B® system, x is much larger, relatively 
enhanced by \V ts /V t d\ 2 -) 

We may define the ratio of the time-integrated transition and survival probabil¬ 
ities 


_ / 0 °° P{B° B°)dt x 2 
r = Jo°° P ( B ° -* B °) dt ~ 2 + x2 ’ 

where a formula 


(9.50) 


e rt cos(A mt) dt = ^ (x = ^), (9.51) 

has been used. By a reason stated below we have also set ||| = 1. In the e + e - 
collider experiments, the ratio r can be measured by observing a ratio 


Ni+i+ + Ni~i~ 

N l+i - 


(~ 2 r for small r and assuming CP invariance), 


(9.52) 


namely the ratio of the event rates of the same sign leptons to that of the opposite 
sign leptons. In the collision B°, B° are pair created, but some of them may 
experience oscillation, so that there appear a possibility that both of mesons become 
identical, producing charged leptons as their decay products with the same sign, via 
processes b —► c + l~ + i/i or 6 — > c + l + + vi. Historically such experiments first gave 
an useful information on Am, which in turn was used to impose the lower bound 
on mt- 

This mixing due to Am or the box diagram is a FCNC process of |A£?| = 2 
with B denoting “bottom number”. We may think of a CP violating or T violating 
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observable in this |AjB| = 2 mixing process in mass matrix, sometimes called as 
“indirect CP violation” (in contrast to “direct CP violation” in the decay amplitude 
of B meson itself). Namely the event rate difference of the same sign di-lepton events 
| N[+i+ - Ni-i -1 will signal a difference P(B° -> B°) - P(B° -> J5°), which is a clear 
signal of CP or T violation. (9.46), (9.47) tells us that the difference is possible 
only when 

(9-53) 

p q p 

Unfortunately, this deviation of |^| from 1 is quite small, and to see the CP violating 
effect is not plausible. The reason is the following. The key ingredient to understand 
the situation is in (9.38). There we see that if the Mi 2 and P r 2 have the same phase 
factor, |^| = 1 , even though £ itself has a large phase factor (, which produces 
large CP asymmetries in B decays as we will see later). As we see in Fig. 9.8, the 
absorptive part Ti 2 is due to the contributions of u,c quarks. In the hypothetical 
limit m u = m c , the orthogonality of the K-M matrix V* b V u d + V* b V c d = - V* b V t( i 
implies that M 12 , handled by t quark, and Pi 2 , handles by u,c quarks have the 
same phase factor, leading to ||| = 1. We thus learn the deviation of ||| from 1 
is strongly suppressed by a factor (m 2 c - rri 2 u ) /rnj. The difference m 2 c — m 2 u was 
compared with m 2 , since in the limit of T 12 /M 12 —> 0, again ||| = 1 is realized. 


9.4 CP Violation in Kobayashi-Maskawa Model 

The 3 generation Kobayashi-Maskawa model was proposed to account for the ob¬ 
served CP violation. As we have briefly discussed in the section 9.1, in this model 
CP violation is closely related with FCNC processes. In other words, the CP vio¬ 
lation needs the breaking of the global “horizontal” or generation symmetry SU (3) 
(n g = 3), which is also needed in FCNC. To see this, suppose m u = m c . Then, 
SU( 2) sub-group of SU (3) in the up-type quark sector, i.e. the symmetry between 
u and c quarks becomes an exact symmetry. By use of this 517(2) symmetry, it 
is always possible to perform a suitable unitary transformation, belonging to the 
SU( 2 ), so that V u d vanishes, while we can always make the 1 st column and the 1 st 
row of the KM matrix real by suitable re-phasing of quark fields q -> e l<t> q. Then, 
it is easy to show (see Problem 9.4) that we can perform a re-phasing such that the 
all matrix elements become real. We learn from this example that if there is a mass 
degeneracy in either of up- or down-type quark sector, CP violation never happens 
(except for the possible strong CP violation, which we ignore). 

Thus there arises a natural question: what is the (necessary and sufficient) 
condition for the CP violation in the Kobayashi-Maskawa model? In particular we 
ask what is the re-phasing invariant quantity which characterizes the CP violation. 
The importance of the re-phasing invariance should be stressed here. In (4.181), the 
CP violating phase S appears in the matrix element of, e.g. Vu, while in (4.183) the 
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Fig. 9.9 


w+ 



matrix element is real. This is a little puzzling, since if we adopt (4.181) it seems 
a process with t —> b transition has CP violating effect, while if we adopt (4.183) 
there seems to be no CP violation in the process. Actually the CP violating phase 
of (4.181) can be eliminated by a suitable re-phasing of t,b. As the physics should 
be the same, irrespectively of the re-phasing, this means it is meaningless to talk 
about such re-phasing variant quantity Vtb to judge whether the CP symmetry is 
violated or not, and we should seek a re-phasing invariant quantity, which is due 
to the interference of different KM matrix elements. Let us now come back to the 
condition for the CP violation. We first note that any observables related to quarks 
are represented by the cuts of Feynman diagrams with closed fermion loops. In fact 
if a transition amplitude A is represented by a diagram G, its complex conjugation 
A* is represented by a diagram G* where the directions of the arrows of all fermion 
lines are reversed, as the complex conjugation implies the charge conjugation. Thus 
\A\ 2 = AA* is represented by combining G and G *, namely by the cut of closed 
fermion loop (see Fig. 9.9 for the example of W + —> ud decay). 

In the language of weak-eigenstates, the origin of the CP violation is attributed 
to the complex Yukawa couplings or mass matrices of up- and down-type quarks, 
M u , Md, inserted into the fermion line of the closed loop (see Fig. 9.10). 


~Pu.U.c. 7feo.fAcmnLtc.af ~Ph.y^Alc.A. 



234 


FLAVOR PHYSICS AND CP VIOLATION 


Actually M u , M d appear in the loop diagram in the combination of hermitian 
matrices H u = M U M*, H d = M d M d , as the right handed quarks have no charged 
weak current interaction. Thus if 

Im I 'r[P 1 (H d )Pi(H u )P 2 (H d )P^(H u ) • • • P' (#„))] # 0, (9.54) 

for some monomials of H Utd , Pi(H d ), P{(H U ) etc., it is the indication of CP 
violation. In the case of n = 1, we easily find that ImTr[Pi {H d )P[(H u )\ = 
0, as {Tr[Pi(H d )P[(H u )]}* = T 'v[P 1 (H d )P{(H u )^ = Tr [P[{H u )P x {H d )} = 
Tv[Pi(H d )P{(H u )}. The first non-zero imaginary part appear at n = 2: 

Im T Y[H d H u H 2 d Hl\. (9.55) 

It can be shown that there appear no more independent imaginary part (Gronau, 
Kfir and Loewy, 1986). On the other hand some arithmetic shows that 
Im Ti[H d H u HjHl\ ? 0 <-> det [H u ,H d \ ± 0. Thus we find the condition for 
the CP violation is 

det [M u Mt , M d M* d ] £ 0. (9.56) 

By use of the fact H u = U\ L dia (ml,m 2 c ,m 2 ) U u l etc. (U u l is the unitary matrix 
used to diagonalize the up-type quark mass matrix), we can show that the above 
condition is equivalent to (see Problem 9.5) 

(ml - m 2 c )(ml - - m 2 a )(m 2 d ~ m 2 b ){m 2 s - ml) xj ^ 0, (9.57) 

where J is the so-called Jarlskog parameter (Jarlskog, 1985), defined by 

J = |Im {ViaV? a VjpV* 0 )\ = sm 2 0 isin 02 sin 0 3 cos 0 icos 02 cos 03 sin <5 

~ A 2 X 5 ri = O(10 -6 ) (i,j = u,c,t;a,/3 = d,s,b\i ± j,a 0), (9.58) 

where A, A, rj are parameters in Wolfenstein’s parameterization of KM matrix (see 
(4.184)). As we will see below, even though there seem to be 9 choices of the 
combination of i,j,a,(3, the imaginary part is unique, up to its sign, corresponding 
to the fact that in 3 generation model there remains only one CP-violating phase. 
We take J to be a positive value. As we anticipated, (9.57) clearly shows that if 
there is a mass degeneracy in either of up- and down-type quark sector, such as 
rn u = m c or m d = m s , CP violation never happens. 

Let us also note that J is invariant under re-phasing of quark fields, iq —> 
e i ^ >i m (ui = u,c,t), d a —>■ e t<t>a d a (d a = d, s, b). In fact under the re-phasing, 
Vi a -> ed-^+^Via and J e d-<l>i+<l>A e K<t>i-4 a ) e K-<t>i+M e i{4>i-<i>f>) j = j. 

The invariance comes from the fact that we have been considering the closed loop 
of fermion, where the creation and the annihilation of all quarks happen in pair. 
The uniqueness of the J parameter is easily understood noting the orthogonality of 
the unitary matrix V. For instance 

Im (V id v; d v js v*) = -Im [v id v; d v jd v* d + v id v; d v jb v; h ] - -Im (V id V* d V ]b V* b ). 

(9.59) 
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Fig. 9.11 

The origin of the J-parameter can be understood diagrammatically. Let us work 
in mass-eigenstates of quarks. Then the origin of the CP violation is attributed to 
the CP phase of KM matrix appearing in the vertices of charged currents. We may 
first think of a diagram with two vertices of charged current where W + and W~ 
are attached, Fig. 9.11(a). 

This diagram does not yield an imaginary part, since Im (Vi a V* a ) = 0. 
Next candidate is the diagram given in Fig. 9.11(b), yielding a non-vanishing 
imaginary part Im( V* a VjpV*^), which is nothing but the J parameter, up 
to the possible sign difference. We can confirm that there appears no more 
independent imaginary part, even if additional pairs of W + and W~ are at¬ 
tached; for instance, Im(V ia V* a V j0 V* 0 V k ^V*) = -\V ia \ 2 Im(V* a V j0 V k * 0 V ka ) - 
\V k0 \ 2 lm{V ia V* a V j0 V* 0 ) (i j ± k ± i, a ^ P ± 7 ^ a). 

Suppose the i-th element of a-th and /3-th column vectors (a ^ fi) have relative 
phase factor Vi a /|Vi a | = ±e i<t> (V^g/IV^I) or arg {Vi a V* 0 ) = </>, irrespectively of the 
choice of i. In this case by a suitable re-phasing, e.g. d a —> e~ l( ^ d a , we can make 
all of Vi a V*p real, eliminating the relative phase. If suitable re-phasing for each of 
ut is further performed, both of a-th and /3-th column vectors can be made real, 
leading to the reality of the third column vector (by a re-phasing) as well, implied 
by the orthogonality of the column vectors. We thus conclude that, in this case, 
there should not be any CP violating phenomena. Let us note that if these three 
complex numbers Vi a V*p (i = 1 , 2 ,3) are drawn as three vectors in a complex plane, 
every vectors are parallel or anti-parallel to each other. On the other hand the 
unitarily of the K-M matrix ViaV*p = 0 means that the three vectors form a 
closed triangle, though the two inner angles vanish, or the triangle is a closed one. 

This argument suggests that the CP violation is closely related with the inner 
angles of such triangles, called “unitarity triangles”. Namely, three independent 
orthogonality conditions JT Vi a V*g = 0 (a / /3) yield three independent unitarity 
triangles, depending on the choice of the pair (a,/3), as shown in Fig. 9.12. 

As we will see below, CP asymmetries, such as those discussed in the neutral 
B meson system, correspond to sin 29, with 0 being a relevant inner angle of an 
unitarity triangle. The reason of the necessity of the factor 2 in the argument 28 
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Fig. 9.12 



Fig. 9.13 

is easily understood; when the triangle is closed the inner angles are either of 0, 
| or 7 r, which means sin 20 = 0 for all inner angles. Let us note that now the 
re-phasing invariance of CP violating observables are manifest; though the each 
side of the unitarity triangle rotates under the re-phasing, the shape of the triangle 
is manifestly invariant, so are the inner angles. The shape itself clearly changes 
depending on the choice of (a,/3), while we know every CP violating observable is 
described by an unique parameter, the Jarlskog parameter J. What happens is, 
although we have several different triangles depending on ( a, ft ), their area S is 
known to be unique and just the half of J: 

j = |im {v ia v; a v jP v; 0 )\ = \v ia v; 0 \\v ja v; 0 \ sin = 25 , (9.eo) 

where Oij is the angle between Vi a V* 0 and Vj a V ^ (0{j < n). As long as the area S 
is unique, the CP symmetry will get large in the system whose triangle has three 
sizes with comparable lengths, as in the case of B meson system (a = b, (3 = d) we 
will see later, while if only one of the sizes is very small the inner angles will be near 
to 0 or |, and the CP asymmetry will get small, as is the case of K meson system 
(a = s,P — d ). See Fig. 9.13. 

So far our argument has been focused on the CP phase in the KM matrix and the 
resultant Jarlskog parameter. We have also seen that for CP to be really broken the 
non-degeneracy of quarks masses in both of the up-type and down type sectors is 
needed. But, actually we still need an additional condition for the CP violation: the 
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presence of the absorptive part in the relevant transition amplitude (the cut in the 
Feynman diagram). In fact, in the end of the section 9.3, we have already seen that 
r 12 = 0 leads to a vanishing CP violating effect in B meson system. The necessity 
of the absorptive part is seen from the following illustrative argument. Suppose the 
transition amplitude of a process A gets contributions form two different Feynman 
diagrams A = Ai + A 2 . We will decompose the each amplitude to the part which 
possibly contains the CP phase 5 due to the weak interaction, and a phase factor 
indicating the possible presence of the absorptive part due to the on-shell property 
of the intermediate state: A 1>2 = A wl<2 e lSl •*, A = A w i e lSl -I- A w 2 e t52 . The 
transition amplitude of the process between the CP-conjugated states is written as 
A( cp ) = A* wl e lSl + A* w 2 e t52 . Let us note the phases <5i ]2 do not change their sings, 
since they have dynamical origin and have nothing to do with the KM CP phase. 
The CP asymmetry proportional to the difference of squared absolute values of A 
and now reads as 

\A\ 2 - | A( CP ^ | 2 = 4sin (6 1 - S 2 ) Im(4S,i^» 2 )- (9-61) 

Here we clearly see that to get CP violation as a physical observable, in addition 
to the CP phase contributing to Im (A* wl A w2 ), the presence of the absorptive parts 
denoted by 5\ t2 is needed. We also see the interference of multiple diagrams, such 
as A* u] A w2 , is inevitable to get the CP violating observable, as suggested by the 
re-phasing invariance: the CP phase in a single diagram can be always rotated 
away by a suitable re-phasing of an external quark field. Thus a tree diagram 
alone never produces any CP violation. We will see below that the possible large 
CP asymmetries in B system of 0(10%) is due to the interference of the decay 
amplitudes of the B meson and the amplitude for B B mixing, namely the 
interference between A B = 1 and A B = 2 amplitudes. 


9.5 CP Violation in the Neutral K System 

The CP violation had been observed only in the system of neutral kaon system 
( K °, K°), although the B-factory experiments in KEK and SLAC, recently started, 
have begun to release the data on the CP asymmetries in the decay processes of 
neutral B meson system In fact, historically the first confirmation of the 

CP violation was made by the detection of the long-lived neutral kaon Kl —> tttt. 
In this section we focus on the CP violation in the kaon system, as it is still playing 
an important and complimentary roles in the study of CP violation. 

As has been already mentioned above, the CP asymmetry in the neutral kaon 
system turns out to be small. This comes from the fact | Vt s V t * d | <C |Fu S — 
\v cs v; d \ (\V ts V t * d \ = 0(A 2 A 5 ), \V us V* d \ = 0(A)). The CP asymmetry in the kaon 
system is thus expected to be 0(A 2 A 4 ) = 0(1O -3 ), which has been confirmed 
experimentally. 

Our main concern is the aforementioned Kl —» irn decay. In the kaon system, 
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in clear contrast to the B meson system, the life times of Kl and Ks are quite dif¬ 
ferent, as suggested by the subscripts L and S, thus making to isolate the Kl state 
possible. The decay of the Kl state is a clear signature of the CP violation. The 
state Kl is determined as an eigenstate of the 2x2 Hamiltonian of the (K°, K°) 
system. If there is no CP violating effect in the Hamiltonian, the eigenstates should 
be the eigenstates of CP transformation, i.e. Kl = K 2 (see (9.7), (9.9)). On 
the other hand, the K 2 state is odd under the CP transformation, while the final 
state of the decay, the tttt system, is even under the transformation. This simple 
argument shows the decay Kl -A rnr never happens, provided the theory is CP 
invariant. Therefore, its detection should be a clear signature of the CP violation 
in the theory. This argument also suggests that actually such CP violation decay 
may have two distinct sources of CP violation, i.e. “indirect CP violation” in the 
Hamiltonian of the (K°, K°), and the “direct CP violation” in the dacay amplitude 
K 2 -> 7T7T. We will discuss these issues successively below. 


“Indirect CP violation” 

The indirect CP violation in the Hamiltonian of the ( K °, K°) is due to the imag¬ 
inary parts of the amplitude of |AS| = 2 mixing process K° «-> K°, denoted by Mu 
and r 12 • As the expected CP asymmetry in the kaon system is very small, 0(1O -3 ), 
we expect ImMu 4C ReM 12 etc., and the deviation of the ratio of coefficients of two 
neutral kaon states qn/pK (see (9.36) - (9.38) from unity is small. Thus, writing 
PkIqk = (1 - e)/(l + e), the two eigenstates of the Hamiltonian are re-written as 

\Kl) = \^= [{l + e)\K°) + {l-€)\K<>)] 

V2(1 + N 2 ) 

- 7raP i)+wl <9 - 62) 

|A's) = , [(1 + W°) - (1 - «)|g°)l 

= 7wm []K ' )+im] - <9 - 63) 

We learn that non-vanishing e indicates Kl, Kg are not pure eigenstates of CP, im¬ 
plying CP symmetry is violated in the Hamiltonian. As Im^glCj) = —Im(Vt s V^) 
(see (4.183) and (4.184)) and the contributions of the intermediate quarks in the 
box daigram behave as E(x c )/E(xt) ~ m 2 c /m% <C 1 (see (9.13)), we conclude 
ImPi 2 ImMi 2 . The experimental values tell us = Tl — Ts(— -Ts) = 

2 T12 ~ -2A nriK = —4M 12 , i.e. Ti 2 ~ —2Mi 2 . We thus get a relation from (9.38) 


e<* .ImMi 2 . 

t=: 7 f 


(9.64) 


Let us note that e cannot be physical observable, since it is re-phasing variant: 
under the U( 1) global transformation concerning strangeness S, s -> e”As or K° -> 
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u 


Fig. 9.14 

e-^K 0 , K° —i e^K 0 , giL (= f=4) —> e~ 2t ^ SJC . Or, for infinitesimal transformation 
\cj)\ <§; 1, e —> l — i(f>. Thus, though Ree ~ |(1 — |y=||) is re-phasing invariant, Ime 
is not re-phasing invariant. 

The re-phasing invariant Ree has been measured by the observation of the 
“charge asymmetry” 

F(K L -» 7 r-l+vt) - r (K l -» n+l-Ot) \p K \ 2 - \q K | 2 

L ~ T(K L -H'-l+v l ) + T(K L ->r+l-ih) \ PK \ 2 + \qK? 

~ 1 - |^-|~2Ree= (3.30 ± 0.12) x 10~ 3 , (9.65) 

PK 

which clearly vanishes if CP is conserved, as it~l + vi and 7 r + l~vi are CP conju¬ 
gate states of one another. As we expected, the observed CP asymmetry is of 
C>(10~ 3 ). This charge asymmetry is equivalent to the non-orthogonality of Ki,Ks : 

Sl = ( K l \K s ). 

Including “direct CP violation” 

CP violating effect may “directly” appear in the amplitude of K°,K° —> 7 T 7 t 
themselves. As a pion has an isospin 1 = 1, the final state 7 r 7 r has either I = 0 or 2 
(1=1 contradicts with Bose statistics of scalar fields). Write the amplitudes of the 
decay to these two possible isospin states as 

A(K° ->• 7 T 7 r(/)) = e iSl A t (I = 0, 2 ), (9.66) 

where Ai are due to the weak interaction, including possible CP phase, while <5/ 
denote the absorptive parts coming from the strong (final state) interactions. The 
corresponding amplitudes for K° decays are obtained by replacing Aj by —A}, 
without changing the signs of 6/. The Aq gets a remarkable contribution from the 
“penguin” contribution with gluon-exchange (see Fig. 9.6), which is expected to 
be responsible for the relative enhancement of the 1 = 0 amplitude compared with 
I = 2 amplitude in the K -k-k decays, ~ 22, so called A I = | rule, while 

the A 2 gets the main contribution from the ordinary charged current process shown 
in Fig. 9.14, though its CP violating imaginary part may be provided by “electro- 
weak penguin” diagram, which is obtained by replacing the gluon by 7 or Z in the 
penguin diagram. 

The final state 7 T 7 t is either 7 r + 7 r“ or 7 r° 7 r°. Write the corresponding measures of 
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CP violation as 


V+- 


Voo 


A(Kl —t 7T + 7T ) _ , 

4(^5 -»• 7T+7T-) =£ + e 

A(K l -> 7r°7r°) _ , 

A(AT S ->• 7r°7r°) ~ 


(9.67) 

(9.68) 


in terms of two parameters e and e', which denote the part common for two kinds 
of decay and the difference of the amplitudes of the decays, respectively. By use of 
a relation 


7r + 7T ) = MI = 0)) + M/ = 2)) (9.69) 

M 0 ) = MI = °)) + \/f M 7 = 2 ))> ( 9 - 7 °) 


and A(Kl tttt)/A(Ks —> ttk) ~ (eA(IG —»• 7T7 t) + A(If 2 —> ttk))/A(Ki mr) = 
e + (A (A -> tttt)/A(Ki —> 7T7r)), we easily get the following approximate relation 


_ . ,1mA). 

- t+, W 

V2 A 


j c i(So-St)/ Re A ., Im A 2 _ ImA 0 
\/2 ReA 0 ReA 2 ReAo 


(9.71) 
), (9-72) 


where we have used the facts A is sufficiently smaller than A in both in their real 
part (AI =1/2 rule) and in their imaginary part (electromagnetic penguin contri¬ 
bution is suppressed compared to ordinary penguin contribution by the difference 
of gauge couplings). As Kl and tt are physical states, the decay amplitudes, and 
therefore e, e', should be re-phasing invariant. In fact, neglecting A, 


£ = A(K L -» M _ 1 ~ _ Irj _ 0 

6 A(K S -> 7T7r) i _i_ £e- 4a ~~ 2 pk A 0 

Pk Aq 


As -4 e 2l< ^ and Ao —* e “^A under the strangeness t/(l) transformation, 

the combination is re-phasing invariant. The re-phasing invariance of e' is 

trivial. 

In the KM model, is less important compared with e, and e ~ e. Thus in 
terms of the coefficient function E(x) obtained from the box diagram, 


-e<‘ 


oGf BM K f\ 

247rsin 2 ^w ArriK 


mt 


im{(v; s v td ) 2 } MM-), 


Ml 


(9.74) 


though additional factor coming from perturbative QCD correction of 0(1) should 
also be multiplied. The experimental value (Particle Data Group, 2002) 


|e| = (2.282 ±0.017) x HT 3 , 


(9.75) 
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is useful to restrict the shape of the unitarity triangle, or on (1 - p)r] (in terms 
of Wolfenstein’s parameterization) from the factor lm{(V t *V t d) 2 }, since we now 
know mt — 175GeV. The prediction of another CP violating variable e' is less 
reliable, since we have two competing terms p^ 2 and . Though |ImA 2 | <SC 
|ImA 0 |, the A I = 1/2 rule relatively enhances the importance of Im A 2 , which gets 
a contribution from the electro-weak penguin diagram. As the coefficient function 
of the FCNC sdZ vertex, in (9.20) grows roughly in proportion to m\ for 

larger m t , the non-decoupling contribution of the intermediate heavy t quark tends 
to cancel the effect of p^ a coming from ordinary penguin diagram, thus making the 

prediction of e' rather smaller than naively expected, i.e. |y | = 0(1O -4 ) to 0(1O~ 3 ), 
while the present experimental data (Particle Data Group, 2002), 

Re f ^ sa ^ = (1.8 ± 0.4) x 10" 3 , (9.76) 

seem to support this expectation. In accordance with our general argument, the CP 
violating observables e, e' result from the interference effects of different amplitudes, 
e.g. the interference of A 0 and A 2 in the case of e'. We may summarize this section 
by classifying the CP violating observables depending on where they have their 
origin, i.e. in the A S = 2 “indirect” CP violation in the mass matrix, or in the 
AS = 1 “direct” CP violation in the decay amplitudes or in the interference of these 
two types: 

• pure AS = 2 CP violation: Re e (Sl) 

■ interference between AS = 2 and AS =1: Im e 

• pure AS = 1 CP violation: e'. 


9.6 CP Violation in the Neutral B System 

In the KM model, as long as there are three independent unitarity triangles de¬ 
pending on the choice of (a,/?) with a,/3 = d,s,b, there should be CP violating 
phenomena, not only in the K system (a = d, = s), but also in the neutral B 
meson systems, By, (a = d, = b ) and B s (a = s,(3 = b). Hence, the observa¬ 
tions of the CP violating events, in the rates predicted by the KM model will be 
very useful for the final confirmation of the model, as the model for explaining CP 
violation, or by chance we may even be able to find some deviation from the pre¬ 
diction, which then will be an indication of some “new physics”. In addition, as 
was discussed in section 9.4, we expect large CP asymmetries, of 0(10%), in the 
Bd system, stemming from the fact that the three sizes of the unitarity triangle for 
(a = d, P = b) are all of comparable lengths, 0(AX 3 ). These are main motivations 
for currently on-going B-factory experiments at KEK and SLAC. In this section we 
mainly discuss the Bd system, and we will simply use the abbreviation B° for Bd, 
unless otherwise mentioned. 
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Although the origin of the CP violation in the B system in KM model is the 
same as in the K system, i.e. the violation of flavor symmetry and the CP violating 
J-parameter, there are a few characteristic features of the B system listed below, 
which requires some sophisticated observational method of the CP asymmetries in 
the system, a bit different from that in the K system: 

a. In the B system, since AP -C Am ~ T ~ ml/mf), the two eigenstates 

of the Hamiltonin, Bi, B 2 defined in (9.36), (9.37), have almost the same life time. 
Thus, in contrast to the case of kaon system, we cannot extract pure B 2 state, 
corresponding to Kl: the observation of CP violation by use of the decays of B\ 
or B 2 is practically impossible, unless one of these states has been prepared as an 
initial state. 

b. In the collision experiments such as those in the B-factories, the initial states 
should be either B° or B°, the (coherent) admixtures of B\, B 2 states, and we 
expect the flavor oscillation B° +4 B°, as was discussed in 9.3, which is similar to 
the neutrino oscillations, except the B mesons decay as the time goes by with the 
almost equal life time r = 1/r. If the oscillation in the duration of the lifetime is 
too rapid, i.e. if x = Am/T = Am r » 1, we in practice cannot distinguish B° and 
B° states, and the CP asymmetries, sensitive to the difference of these two states, 
will be washed out, no matter which initial state we prepare. Fortunately, the ob¬ 
tained value of x from the observation of the same sign leptons in e + e” collisions 
(see (9.50)) is in a suitable range in order to avoid such washing out, x ~ 0.7. 

c. Although we generally expect the CP asymmetry in the B system is quite large, 
the pure A B = 2 CP asymmetry, oc P(B° -A B°) — P(B° -» B°), is much smaller 
than the naive expectation, being suppressed by ||| — 1 ~ (see the argument 
below (9.53)). We also note the rates of “direct” CP violation in the pure AB = 1 
decay amplitudes cannot be predicted with accuracy: the (partial) decay rate asym¬ 
metries oc r(B° —> /) — r(B° —> /) (/, / : distinct final states) need the interfer¬ 
ence among different amplitudes, which have absorptive parts with different phases, 
whose estimation is not easy especially for those caused by strong interactions. 

d. Thus the remaining candidate to see the sizable CP asymmetries with reli¬ 
able predictions, is the CP asymmetries caused by the interference effect between 
AB = 2 and AB = 1 amplitudes, corresponding to Im e in the K system. In fact, 
we will see below the CP asymmetry is due to 


Im(^° 


/) 


p A(B° -> f) 


), 


(9.77) 


which corresponds to -21m e in K system (see (9.73)). Such type of CP asymmetries 
is known to be of 0(10%), as was first pointed out by Sanda and collaborators 
(Bigi and Sanda, 2000). Another nice thing of this type of CP asymmetry is the 
possible phase due to strong interaction exactly cancelled out, as long as we choose 
a final state / such that the decay amplitude A is dominated by one Feynman 
diagram, as in the case of / = ipKs■ In this type of CP violation B° o B° 
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oscillation obviously plays a central role. In fact, the CP asymmetry is caused by the 
interference of two distinct amplitudes, B°(B° ) —> f and B°(B°) -4 B°(B°) — > /, 
realized by the oscillation. (We may rewrite the CP violating factor Im ) 

as |>4(go 1 —>/)[ 2 I m (p A(2?° -» /) A(B° -»• /)*).) Thus we need a non-vanishing Am 
for the oscillation, and the observed value of x ~ 0.7 is in an ideal range, 
e. Although the CP asymmetry of this type is expected to be quite large, this 
in turn means the lengths of the three sizes of the relevant unitarity triangle are 
tiny, 0(A\ 3 ). Namely the branching ratios of the decay processes to observe the 
CP asymmetries are rather small, e.g. Br(B -4 tpKs) = 0( 10 -4 ). Thus, even 
though the CP asymmetries are sizable, we still need high luminosity for the collision 
experiments. Let us note that in the K system, Ks —> 7T7t was the main decay mode. 
Of course, the large CP asymmetries will help a lot in the statistical significance of 
the signal: needed luminosity is proportional to 1/A 2 , instead of 1/A, with A being 
a generic CP asymmetry. 

We are now ready to discuss the sizable CP asymmetry caused by the JB° -H- B° 
oscillation. For instance in e + e” collision experiments, e + e _ -» B°B°, B + B~, 
we a priori do not know which one of two neutral mesons is B°, which makes the 
observation of CP asymmetry difficult. We start with the idealized situation, where 
we know that at time 0 B° or B° was produced, and denote their state at time t 
as B°(t) or B°(t) with B°(0) — B° or B°(0) = B°, which are linear combinations 
of B° and B°, as shown in (9.46) and (9.47). Let us consider a CP asymmetry, 
which is the difference of the probabilities, or event rates, for the state starting as 
B° or B° to decay into the final state / or / at time t, divided by the sum of the 
probabilities: 


A(t) = P{B°(t) -> /) - P(B°(f) ->• /) 

where 


(9.78) 


P(B°(t) -> /) oc \g+(t)A(B° -4 /) — ^g-(t)A(B° -4 /)| 2 , (9.79) 

P{B°(t) —>• /) <x \ g+ (t)A(B° ^ f) - V(t)A(S°^/)| 2 . (9.80) 

As we have mentioned above, we can safely assume ||| = 1. We also take a CP 
self-conjugate state as the final state, / = ±/, which has a merit to allow the both 
processes B° —► / and B° —> / possible, and to eliminate the uncertainty in the 
phases due to the strong interaction, “strong phases”, in the ratio : As 

the strong interaction preserves CP, generally the strong phases of A(B° -» /) and 
A(B° —> /) are exactly the same, which means for the case of / = / the ratio is 
free from the strong phase. The typical examples are f = ipKs, ttk. Under these 
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assumption the time-dependent asymmetry A(t ) may be written as 

cos(Amt)[r(B° -> /) - T(B° -4 /)] + 2sin(Amf)r(B° -4 f)lm (£$g^{) 

(r(B° -4 /) + r(s° -4 /)) 

(9.81) 

where, the partial decay widths are proportional to the squared-transition- ampli¬ 
tudes: 


r(J5° -4 /) cx |A(B° -4 /)| 2 , r (BO _> /) « |yL(£° /)|3. (9.82) 


In the expression for the time-dependent CP asymmetry A(t), the term accompanied 
by cos(A mt) does not go away at t = 0, and is the indication of the direct CP 
violation in the A B = 1 decay amplitude, which does not need the help of the 
flavor oscillation, while the term accompanied by sin(Amf) and Im (p^ fol^ )) 
vanishes at t — 0, and indicates the CP violation due to the interference between 
A B = 2 and A B = 1 amplitudes, with large CP asymmetry, we are seeking. As 
we have discussed above, we are interested in the process where the decay into / 
is dominated by one Feynman diagram. In that case the direct CP violation is 
negligible, i.e. T(B° —4 /) ~ r(f?° —4 /). In this way, the formula for the CP 
asymmetry greatly simplified into 

Mt) = sin(Amf) ■ Im (i^|^±il). (9.83) 


It seems the time-integrated or time-averaged CP asymmetry vanishes, as the time- 
average of sin(Amt) just goes away. This is the case provided T = 0, as in the 
CP asymmetries in neutrino oscillations, The finite lifetime of B mesons, however, 
makes the smearing of the CP asymmetry due to the oscillation incomplete. From 
a relation, 


r °° r°° _ 

( / e -rt sin (Amt) dt)/( / e~ rt dt) = --- 

Jo Jo 1 + x 

we get the time-integrated asymmetry, 

f 0 °° dt [P(B°(t) -4 /) - P(B°(t) -4 /)] 
- / 0 °° dt (P(B°(t) -4 /) + P(B°(t) -A /)] 

X g A(B°-4/) 

~ l + x 2 >A(£°-4/) ; ’ 


(9.84) 


(9.85) 


where the smearing factor ~ 0.5 is fortunately not small. Write the time- 

independent CP asymmetry factor as a(f) = Im (pj^foE^})- As the three sizes 
of the unitarity triangle are of comparable lengths, and the coefficient function 
representing the contributions of = u,c,t to A B = 2 box diagram grows roughly 
as ~ mf /M'w the My 2 , r ]2 stemming form the box diagram are dominated by the 

t quark contributions. Thus £ ~ is written in terms of KM matrix elements 
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related to t quark, (V t3 V t * d ) 2 . Thus for the most promising channel to observe the 
CP asymmetry in the B-factory experiments, 


a(TrKs) — Im( 


{VtoVu? 

\Vtt,V t * d \* 


) 


—sin 20i, 


(9.86) 


where = e*^ 1 ~' r * was used (see Fig. 9.13(a)). As we expected, the CP 

asymmetry ts described by the sin of an inner angle of the unitarity triangle in 
(b,d) sector. In terms of Wolfenstein’s parameters p and r; of 0(1) (see (4.184)), 

(987) 


and the |a(7r.?Cs)| can be of order 10%. This large CP asymmetry has already been 
observed at B-factories: Belle and BaBar experiments have reported |a(7rATs)| ~ 


70%. 


One may wonder why the expression above is not re-phasing invariant: the phase 
of {VtsV t ' d ? changes by the re-phasing. Actually when the factor — 

is taken into account, we realize that the whole ex- 
pression is re-phasing invariant, as it should be. Let us note that V* b V C8 comes from 
the decay b —» ccs due to the ordinary charged current process, while p* K / q* K ~ 
( V* s V c d)/{ V cs V* d ) reflects that K° and K° are contained in Ks by the fractions px 
and qx- As the matter of fact, however, V* b V cs is almost real and the CP violating 
phase appearing in p* K /q* K — 1 + 2e* is suppressed by |e| = (9(10 -3 ). That is why 


the CP asymmetry is dominated by the phase of 

So far we have assumed that we can prepare a neutral B meson with definite 
flavor, i.e. B° or B°, at t = 0. Such identification is crucial when we discuss CP 
asymmetry, since the interchange B° B° at t = 0 will change the sign of CP 
asymmetry. Unfortunately such identification of flavor is not possible in a realistic 
situation of e + e~ experiment, such as the B-factory experiments Belle and BaBar, 
since B° and B° are produced in a pair. In addition the presence of B° f-> B° 
oscillation makes the identification difficult. One intriguing idea is to tag the flavor 
of one of the mesons to be, e.g. B°, by the observation of its decay products 
at some time t. As B mesons are produced at the collider experiment through 
e + e“ -4 Y(4S) -> B°B°, the relative orbital angular momentum l of two mesons 
is 1, i.e. p-wave, whose wave function is antisymmetric under the exchange of two 
mesons. Thus the remaining meson at time t should have opposite flavor, e.g. B°, 
even though two mesons may have the same flavor at different times as the result 
of the oscillation. If we regard the time of the tagging t as a sort of initial time we 
are able to repeat the argument extended above for the idealized situation. 

Let us illustrate the idea more concretely by explicit calculations. At the initial 
time when B mesons are produced via the decay of T, the state of two mesons with 
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momenta k and k' is described by 

~=[\B°(k))\B°(k')) + c\B°(k'))\B°(k))}, (9.88) 

where c = (—1)*. Though in the on-going B-factory experiment 1 = 1, c = — 1 as 
explained, c = 1 is possible, e.g. in the process Y(55) -A B°B°* -A B°B°y, where 
l = 0 is possible. Let f a a final state, which both of B° and B° can decay into, 
i.e. f a = f a , and fb a final state, which only B° can decay into. Namely fb can 
be utilized for the tagging of B°. Let P(/ 0 , fb', t, t') c dtdt' be the probability for the 
mesons with momenta k and k' to decay into f a and fb during the time intervals 
(t,t + dt) and (t‘, t' + dt'), respectively. Then, 

P(fa,fb;t,t') T 

|i[< X f b \B°(t') >+c< fb\B°(t') >< fb\B°(t) >]| 2 

T(B° -A fa)T(B° -A fb)e~ rt + 

{! - Im (p ^ ' ( ~ fo l| y°j ) sin(A mt^)}, (9.89) 

t ± t' ( t + > 0, — 1 + < < f+). Similarly, 

P{faJb',t,t % 

oc r(J9° -> f a )T{B° -a fb)e~ rt+ 

■ {l + Im (j ^go sin(Amt T )}, (9.90) 

where a good approximations r(P° -A f a ) ~ r(B° -> /„), and T(B° -A fb) ~ 
T(P° -A fb) have been used, which are valid for decay processes dominated by tree 
level diagrams. The subscript ^ correspond to c = =f 1. At the time t' of tagging of 
the meson with momentum k', another meson is known to have different flavor, and 
if we regard f_ as the time t in the ideal situation it is expected that we have a CP 
asymmetry with the same time dependence as what we got above for the idealizes 
case. In fact, for the case of our main interest c = —1, after the integration over t + 

[ P(fa,fb',t,t')~ dt+ oc e~ r|t-l {l — Im (- ^[p 0 ^ fo ) sin(Amf-)}, (9.91) 

y |t_ | pA{D u Ja) 

[ P{faJb;t,t')- dt+ oce _r|t - | {l + Im (~ ^/o 0 ZI /"! ) sin(Amf_)},. (9.92) 

y |t_ | pA(tr 11 fa) 

As we expected if we replace t- by t we recover the previous results. One remarkable 
difference from the previous case is that now t- is not positive-definite, but may be 
negative ( t < t'). Thus if we further integrate the probabilities over f_, the term 
proportional to the CP asymmetry Im (| ^(goZ^° ' ) ) will disappear. Thus we need 
to identify the decay times t and t'. This is why the on-going B-factory experiments 


oc 

oc 

where t± = 
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are performed by asymmetric collider where the energies of e + and e~ are different 
and therefore the center of mass system is moving. Thus the difference of t and t' is 
measurable by identifying the corresponding decay points. If we denote the number 
of events for the two mesons to decay into f a and /(, at time t and t' with t > t', 
etc. by N(f a , fb)t>t' etc., the time-integrated CP asymmetry A = j—j a (fa) is 
given as 


^ _ N(f a , fb)t>t' - N(f a , fb)t>t' - N(f a ,fb)t<t' + N(f a , fb)t<t’ 
N(fa, fb)t>t' + N(f a , fb)t>t’ + N(f a , fb)t<t' + N(fa, fb)t<t‘ 


(9.93) 


Problems 

9.1 Consider the Yukawa couplings of two Higgs doublets <f>i, fa, given in (9.3). 
Show that a linear combination of their neutral components 

~ 7 2 "i ( Vl Vi + ^ 2 ^ 2 ) ( v i 2 = (^ 1 , 2 )) has flavor-diagonal Yukawa coupling (no 
FCNC). 

9.2 Utilize the effective FCNC 7 -vertex given in (9.21) and (9.23), (9.24) to get 
the effective FCNC gluon-vertex, i.e. the contribution of “penguin” diagram. The 
comment below (9.24), concerning the Q-dependence of the effective 7 -vertex may 
be helpful. 

9.3 Solve the time-evolution equation (9.42) to get the solution (9.44), (9.45). 

9.4 Prove that when m u = m c , for instance, all the elements of the KM (Kobayashi- 
Maskawa) matrix can be made real, by use of suitable re-phasing of quark fields. 

9.5 Verify that the condition (9.56) is equivalent to (9.57). 
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Appendix A 

Notation and Useful Relations 


A.l Four vectors 

Throughout this book, we use the natural unit h(= h/2n) = c = 1, where h is the 
Planck constant and c is the light velocity. We follow the convention of Bjorken 
and Drell (Bjorken and Drell, 1964). 

Space-time coordinates x = (t, x, y, z) = (t, x) are described by the contravariant 
4-vector 


x^ = (x°, x 1 , x 2 , x 3 ) = ( t,x,y,z) . 
The metric tensor is defined as 


T = 9»v 


{ 1 


V 0 


-1 


-1 


0 \ 


-1 / 


and produces the covariant vector 

*ii = 9 ^x v = (xq,xi,X 2 ,x^) = ( t,-x,-y,-z ) 


(A.l) 


(A-2) 


(A.3) 


The repeated indices are summed unless otherwise specified. Then the scalar prod¬ 
uct of space-time points is given as 

x 2 = x^Xf,, = g lil/ x ,i x 1 ' = g^x^x,, = t 2 - x 2 . (A.4) 

The four-momenta are similarly given as 

P^ = (P 0 ,P\P 2 ,P 3 ) = ( E,Px,Py,Pz ) ■ (A.5) 

The four-gradient is defined as = d/dx 1 * or = d/dx^. Then the momentaum 
operator in the coordinate representation is given as 

p"=z^= . (A.6) 
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Notation and Useful Relations 


Thus, 


P"P/* = -d^d^ = + V 2 = -□ , 


(A.7) 


where V 2 and □ are called the Laplacian and the d’Alembertian operator, respec¬ 
tively. 


A.2 7 matrices 

The Dirac 4x4 7 -matrices satisfy the following anticommutation relations, 

{ 7 ^, 7 "} = 2 gTI 

where I is the 4x4 unit matrix. (A. 8 ) leads to the following relations: 

= 4 , 

l^lvln = - 27 „ , 

'flvlpln = ±g vp , 

l^lvlplain = —^lalplv ■ 

Define the chirality operator 75 as 


which satisfies 


and 


7 5 = 75 = i7°7 1 7 2 7 3 


7s 2 = 1 , 


{75,7 M } = 0 . 


(A. 8 ) 

(A.9) 
(A. 10 ) 
(A.ll) 
(A.12) 

(A.13) 

(A.14) 
(A.15) 


By using the Levi-Civita tensor which is totally antisymmetric, one can also write 
75 as 


75 = p. V pal' 1 l V l P l a 


where 


+1 for even permutations of 0 , 1 , 2 ,3, 
— ^ f or permutations of 0,1,2,3, 

0 otherwise. 

The Levi-Civita tensor satisfies the useful relations: 

x^txnxv xnx^\ 1 x* 


(A.16) 


(A.17) 


e aXtlv e, 


apar = -^(<W - «) + 8^5" - 6*6 v p ) 

-6${6W-6$6;), 


e a ^e a0pa = -2 m-SW), 


(A.18) 

(A.19) 
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£ a ^e a ^ p = -68$ 

£ a0lS £ a 0yi = -24 , 


where 6% is the Kronecker’s delta, 


St 1 , = 



H = v 
fi v . 


The scalar product of a 7 -matrix and a 4-vector is given by 

7 M a M = f i = 7°a° -7-0 


and similarly 

7 ^ — *7 ^ = i<j? = i~f 0 dt + *7 • V . 


There are two kinds of useful representation of 7 -matrices. 
( 1 ) 7 °-diagonal representation: 


7 


0 


I 0 
0 -I 


7 = 




( 2 ) 75 -diagonal representation: 



where I is the 2x2 unit matrix and 5 is the 2x2 Pauli matrix 


(7 x — 


0 

1 





The spin tensor is defined as 


= ^(7Y-7Y) = ^[^7l, 


which, in the 7 °-digonal representation, leads to 

o‘ ) and n" 
(i,j,k = 1,2,3) 



where e l i k is the totally antisymmetric tensor with e 123 = 1 . 


(A.20) 
(A.21) 

(A.22) 

(A.23) 

(A.24) 

(A.25) 

(A.26) 

(A.27) 

(A.28) 

(A.29) 
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Notation and Useful Relations 


A.3 Trace theorems 


In calculating the transition matrix elements, the following trace theorems are use¬ 
ful: 


Tr I 

= 4, 

(A.30) 

Tt7 5 

= 0, 

(A.31) 

Tr(odd number of 7 's) 

= 0, 

(A.32) 

Tt(7m7^) 

a. 

Oi 

I! 

(A.33) 

Tr(7 Ai 7^7p7<r) 

— 4 {9po9p<r 9pp9ua T 9pcr9vp) ) 

(A.34) 

Tt(757m) 

= 0, 

(A.35) 

Tt( 757 m 7 v) 

= 0, 

(A.36) 

Tt(757 m 7^7p) 

= 0, 

(A.37) 

Tr(757 M 7^7p7<T) 

— 4i 

(A.38) 


The following relations are also useful: 

Tr(7 M 7*i 7*7*2) = 4 [p?p£ + Pjp£ - g^ipx ■ P 2 )} (A.39) 


Tr [7^(1 - 75)^17 1/ (1 - 75)&] = 2Tr(7^ 1 7‘ , / 2 ) - Si£ tiva0 pi a p 2l3 (A.40) 


Tr(7 M A7‘ / &) Tr (7MM«^4) = 32 [(pi • p 3 )(p2 ■ P4) + (Pi • P4)(P2 ■ Pa)] (A.41) 


Tr(7'‘^ 1 7‘ / 75/5 2 )Tr(7 M ^37 v 75^4) = 32 [(pi • p 3 )(p2 • P4) - (pi • P4XP2 • P3)] (A.42) 


Tr [7^(1 - 75)7*17 < '(1 - 75)7*2] Tr [7^(1 - 75)7*37X1 - 75)7*4] = 256(pi • p 3 )(p 2 • P4)> 

(A.43) 


A.4 Right- and left-handed Dirac spinors 


The Dirac equation for a free fermion with mass m is given by 

(*7 M cX — x) = 0 , (A.44) 

where ip(x) represents the 4 component spinor. It is convenient to define chiral 
spinors (left- and right-handed spinors) as 


ip L = 

= Pl 

2 75 

i’R = 

£ 

II 

- 5 >- 

£ 

1 « 
i-H 

(^ S 1 2^ 


(A.45) 
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where Pi and Pr are projection operators of left-handed and right-handed spinors, 
respectively, and satisfy 

Pl + Pr = 1, Pl=PL, P' r = Pr , 

Pi = P L , Pr — P R , P L P R = PrP l = 0 . (A.46) 

Note that 


lb*R = ipR and 75 rpi = -*l ■ 


(A.47) 


Thus, ipL and i/jr are eigenvectors of the chirality operator 75 . Define the Dirac 
adjoint spinor 1 ]> = 7 /^ 7 °. Then we have the following important identities: 


*Pr = *L , *Pl = *R > 

(A.48) 

fa* 1 * = + itRl^lpR , 

(A.49) 

*^15* = ^Rl^tpR - , 

(A.50) 

= *R*L + *l4>R ■ 

(A.51) 


A. 5 Dirac equation in the 75-diagonal representation: 

When we write the 4-component Dirac spinor in terms of 2-component Weyl spinors 
X and 7 j as 

(A.52) 

we have the relations in the 75 -diagonal representation 

Pr* = *r = ( 0 ) > PL* = *L=(° rj y (A.53) 

where x and 77 are unrelated 2-component spinors. In the 75 -diagonal representation, 
the Dirac equation is written as 

v )(;)-»(;)=»• <**> 

which results in the following simultaneous equation 


(-i—- iff ■ V)x - mri = 0, 

(A.55a) 

(-i— +ia- V)r/-mx = 0 . 

(A.55b) 
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If m = 0, we have two independent Weyl equations, 



4*= ~ is - Vx ' 


(A.56a) 


. d 

l dt 1 = lcr ' Vr >- 


(A.56b) 

Assume the plane-wave, 

X(x) - x(p) e ~ lpx , 

T)(x) = fj(p)e~ tpx . 


(A.57a) 
(A.57b) 

Then we have 

Ex = +a-px ■ 

Ep = —a ■ pp , 


(A.58a) 
(A.58b) 

For positive energy solutions with E = |p| > 0, we have • 




a ■ p 

nr = + *' 


(A.59a) 


S • p 

-w = 


(A.59b) 

Defining the helicity operator by 




s-p lap 

\Pl “2 bV 


(A.60) 

Then we can write 

. 1 
hx = + 2 % , 


(A.61a) 


hr) = -Ip . 


(A.61b) 

that is, helicity of x and i) is +| and — |, respectively. 

If we use the 4-component notation for spin as s = ^ ^^ 

° 1 
a/2 J 

| , we have 



where p = ^. Therefore, helicity h is the same as chirality 75 for massless fermions 
with positive energy. For negative-energy fermions(which correspond to antiparti¬ 
cles), helicity h has the opposite sign to chirality 75 . If a fermion is massive (m ^ 0), 
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X and 77 are still eigenstates of chirality, i.e. 


and 75 


(A.63) 


but they are no longer eigenstates of helicity h because of the non-zero mass term 
in the Dirac equation. 


<7 px - +Ex + rnrj . 
a ■ prj = —Efj - mx , 


(A.64a) 
(A.64b) 


But for high energy particles with E m, we can approximate ^ and 

then we have 


1 m 

h X = + 2 *+ 2 e”' 

. 1 m 

= “ 2 ,_ 2 £ X ' 


(A.65a) 
(A.65b) 


By using the eigenstates of h, x' and 77 ', as defined as hx' = + |x' and hr] 1 = —\r)', 
respectively, one can write them as 


/ m 1 

X +^V , 
, m , 

~E X ' 


(A. 66 a) 
(A. 66 b) 


That is, x and 77 are given by the mixture of both helicity states where contribution 
of wrong sign helicity is order of m/E. 
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Appendix B 


Cross sections and Feynman rule 


B.l Cross sections 

The scattering cross section or the transition probability for particle reactions is 
described by the S matrix as 

S = l+iT (B.l) 

where T, being called the transition matrix, denotes the transition between initial 
i and final / states. 

First, consider a 2-body collision, a(p a ) + b(p b ) —> c(p c ) 4- d(pd), as an example, 
where the momentum of each particle is given in parentheses. The transition matrix 
element for this process can be written as 

T fi = (2tt) 4 <5 4 (p a +Pb~ p c - p d )N a N b N c N d M fi . (B.2) 

For the time being, we consider the case of spinless particles. Extention to more 
general case including particles with spins is straightforward as described later. In 
(B.2), all dynamics for this transition are included in Mfi which is Lorentz invariant 
and called the invariant amplitude. TV is a normalization factor which depends on 
the convention of the wave function normalization. Here we take the convention 
for normalizing 2 E particles in a volume V, i.e. N = by defining the free field 
wave function as 

cfi(x) = (B.3) 

which leads to f pdV = 2 E with p — i ((j>*do<j) — do </>*</>) = Substituting N = 
-j= for all particles involved into (B.2), we can write 

T fi = (27r) 4 5 4 (p a + pt -p c -pd)^M fi> (B.4) 

The transition probability per unit time and unit volume for this collision is 
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Cross sections and Feynman rule 


given by 


W f i = M (B.5) 

where T is the time interval of the interaction. By using the relation 

[S\Q)] 2 6\Q)j^ I d 4 xe iQx (Q = 0) = (B.6) 

Wfi can be written as 

Wfi = ^^-6 4 {p a + p b - p c - p d )\M fi \ 2 . (B.7) 


Prom (B.7), we can obtain the cross section a by multipling it by the number of 
available final states and dividing it by the incident flux F. As is well known, the 
number of states in which a particle in a volume V has its momentum between 
p = ( Px,P y ,Pz ) and p + dp = (p x + dp x ,p y + dp y ,p z + dp z ) is . Since we are 

normalizing 2 E particles in V, the number of states per one particle is 

Then, the number of available final states for the process a + 6 —► c + d is — 1 - — p - 
Vd 3 y d 


(2ir)®2 E c 

( 2 n)$ 2 E d ■ The incident flux F can be calculated as follows; if we consider the process 
in the laboratory frame, the number of the incident particle a passing through unit 

area per unit time in this collision is ^y^\vi n \, where V{ n is the velocity of the 

2 Eu 


incident particle a, while the number of the target particle b per unit volume is v 
Thus, the incident flux for the collision of a and b is F = \vi n \ 2 (t 6, ■ Then we ca 

obtain the cross section for this process as follows; 


da(a + b —> c + d) 


1 


\v in \4E a E b (27r) 2 
d 3 p c d 3 p d 


< 5 4 (Pa +Pb-Pc - Pd) 


2 E c 2 E d 


\M(a + b^ c + d)| 2 


(B.8) 


The normalization volume V disappeared from (B.8). This is natural because V 
is not a physical parameter. Therefore, hereafter we set V = 1 which corresponds 
to take the wave function normalization as N = 1. This is the reason why we can 
assign 1 to the external line of the spinless particle in Feynman diagrams as shown 
below. 

In the general collinear collision of a and b, (incoming velocity of a) in the 
incident flux F = \vi n \AE a E b in the laboratory frame is replaced by v re i (relative 
velocity of a and b). F can be rewritten into the Lorentz invariant form as 

F — \AE a E b — 4|w a v b \E a E b 

= 4(|p a |p; a + \p b \E b ) = 4 \J{p a -Pb) 2 ~ m 2 a m 2 b ,. (B.9) 
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Now, the extension of this formula to the n particle production processes, a+b -4 

1 + 2 H-+ n, is straightforward, 


da (a 4- b — ^ 1 -f- 2 4 • • ■ + ti) 

1 


A H -\o-o-2 /'9'7r\(3n—4) ^ Pb ^ ^2 

4 \/(p a • p &) 2 - »n 2 m£ ( 27 r)i Jn 4 1 
d 3 Pi d 3 p 2 d 3 p n 


2E\ 2 E 2 2E n 


\M(a + 6 —^ 1 + 2-h n)| . 


~Pn) 

(B.10) 


One can easily generalize these formulas to processes including particles with spin 
by replacing |M/i| 2 in (B.8) and (B.10) by Yhs pin |M/»I 2 > which means that the spin 
degree of freedom must be summed for the final states and averaged for the initial 
states if the spin of initial particles is not polarized. In addition, for spin | fermions 
the normalization convention for the wave function, 


u^ s \p)u^(p) = 2m, v^(p)v^ s \p) = —2m, (B.ll) 

and also the energy projection operator, 

a+(p) = u(s) (p)^ ( s) (p) = i> + 

S 

A _(p) = — = — i> + m, (B.12) 


must be used in these cross section formulas. 

When we write the cross section formula (B.10) as 


da (a + 6->l + 2 + -- -+n) = ^^-dR {n \ 

F 

dRW is called the Lorentz invariant phase space which is given by 


dR M = (27T ) 4 6 4 (p a +Pb~Pl~P2 - Pn) 

d?pi d 3 p2 d 3 p n 

(2ir) 3 2E 1 (2n) 3 2E 2 "' (2-K) 3 2E n 


(B.13) 


(B.14) 


B.2 Decay width 


The decay width of a particle can be calculated similarly; the decay width formula 
for a particle with mass M decaying into n particles in its rest frame is given by 


dT(M 


x 


1 + 2 4- • ■ • + n) 

2^ (27 T )ln~A) \ M fi\ 26 ^P a +P„-Pl-P2 - Pn) 

d 3 pi d 3 p 2 d 3 p n 
2Ei 2E 2 2E n 
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Cross sections and Feynman rule 


B.3 Feynman rule 

The invariant amplitude Mji for an specific process is given by the sum of Feynman 
diagrams corresponding to its process. They are usually composed of three kinds 
of factors, i.e. external lines, propagators (internal lines) and vertices, which are 
described below. 

Feynman rule for QED 

(1) External lines 

• spin=0 bosons: For both incoming and outgoing bosons, assign 1. 

• spin=| fermions: For an incoming fermion or anti-fermion, assign u^(p) or 
v^(p), respectively. For an outgoing fermion or anti-fermion, assign > (p') or 

)(p'), respectively. 

• spin=l photon: For an incoming photon, assign e^\k). For an outgoing photon, 
assign e*J- X ^ ( k ') 

(2) Propagators 

• Boson with spin=0: 


p 2 — m 2 


• Fermion with spin=|: 


i _ . + m 

$ — m p 2 — m 2 


• Photon with spin=l: 


^wwwv =i(-«r + ( i-o^) 

where £ is the gauge parameter. In the Feynman gauge, £ = 1. 

(3)Vertices 

• spin=0 boson-photon vertex: 
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/ 

•x •** 

S / 

-ie(p + p% 

(for charge +e) 




Feynman rule for QCD 

(l)External lines 

• spin=| quarks: For an incoming quark or anti-quark, assign u^(p) or v (s) (p), 
respectively. For an outgoing quark or anti-quark, assign u^ s '^(p') or v(°‘)(p'), re¬ 
spectively. Though quarks have the flavor and color degree of freedom, it is not 
written explicitly in these spinors. 


• spin=l gluon: For an incoming gluon, assign ej^(fc)y. For an outgoing gluon, 

assign e*J~ X ^ Here (i = 1,2, • • •, 8) represent the color degree of freedom. 

(2)Propagators 

• Quark with spin=|: 


i 

p — m 


. 15 + 771 

l ~2 -2 

p z — m* 


• Gluon with spin=l: 
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omrrimrrcTr^ = ^ + 

where £ is the gauge parameter. In the Feynman gauge, £ = 1. 
(3) Vertices 

• Quark-gluon vertex: 




9sfijk\j9l ill'll ^2 )A 4" Qv a (^2 ^3 )/( 

+5A M (fc3 - *l)«,] 



~^‘9s[fijmfklm{9p\9i'P ~ 9pp9v\) 
4 " fitmj'jkmi,9pv9Xp ~ 9p\9vp) 
4 " fikmftjm(9pp9v\ ~ 9pv9\p)\ 


Feynman rule for the GWS model of electroweak interactions 
(l)External lines 

• spin=0 Higgs bosons: For both incoming and outgoing Higgs bosons, assign 1. 

• spin=| leptons and quarks: For an incoming fermion or anti-fermion, assign 
u W (p) or v^(p), respectively. For an outgoing fermion or anti-fermion, assign 
u^ s '\p') or v^ s '^(p'), respectively. Electroweak interactions are not affected by color 
charges. 
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• spin=l gauge bosons: For an incoming gauge boson, assign 

going gauge boson, assign e*J- X \k') 

(2) Propagators 

• Higgs boson with spin=0: 


i 

P 2 ~ P 2 


• Lepton and quark with spin=l; 


i 

f-m 


. j> + m 

1 p 2 — m 2 


• Gauge boson with spin=l: 


fc 2 -M 2 


(" 




+ 


(3)Vertices 

• Charged current vertex: 



. For an out- 


*'***'\ 
M\ ) 


1-75 

2 






264 


Cross sections and Feynman rule 



• Neutral current vertex: 



where 

a{ = — ^ + sin 2 Qw, a f R = sin 2 6w, for / = e~,n~,r~ 
£ 

12 2 

a[ == - --sin 2 0vv, a f R = - - sin 2 0 W , for/ = tt,c,f 

a[ - -- + ^sin 2 ^, a£ = isin 2 0w, for f = d,s,b. 

and for massless neutrinos, a‘ L — and a R — 0. 

• Gauge boson vertex: 

1. Three point interaction vertex: 

7 W + W~ vertex: 



ie[g v \(ki - fc 2 ) M + 9\»(k 2 - k y ) v 

+9nv(k-y - fci) A ] 


Z°W+W~ vertex: 
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W + , 



W~,k 2 , A 


iecot9 w [gv\{ki - fc 2 ) M + 9\n ( k 2 - fo)./ 

+^(*3 - *i)a] 


2. Four point interaction vertex: 



— ie* {%Qaf}Qiiu 9<*u90 v 9ocu90fi) 



id} COt 6\\r {^9 oi09ij.v 9ay.90v 9ati/90fi) 



ie^ COt^ @w(29a09ni' 9ocfi90v 9av90{i) 



• Higgs boson ( H ) vertex: 
1. HW + W~ vertex: 
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W+,k u u 


Cross sections and Feynman rule 



W + ,k 2 ,X 


ie 

sin Ow 


Mw9v\ 


2. HZ°Z° vertex: 



2ie 

sin 26 w 


Mzgv\ 


3. Fermion-Higgs Yukawa coupling vertex: 


ie rrif 
2 sin 9\y M\y 

I 

k 



4. Higgs boson self-coupling vertex: 





I 

k 


3 fj?e 


2M\y COS 6\v 


5. Four point interaction vertex including Higgs boson: 
HHW+W- coupling: 
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HHZ°Z° coupling: 


2 ie 2 



HHHH coupling: 




JT 




'H 


3 ifi 2 e 2 
sin 2 6w 
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Appendix C 

Basics of the group theory 


The Lagrangian is the fundamental object describing the dynamics of the physical 
system and is to be constructed so that the interactions are invariant under certain 
symmetry transformations. The requirement of invariance under symmetry trans¬ 
formations is a guiding principle for the construction of interacting field theories. 
Here we give a little mathematical basics of the symmetry, i.e. the group theory, in 
particular, an SU(n) group and its algebra. 


C.l Group and representation 

A group G is defined as a set of elements {a, b, c, ■ ■ •} with the follwing conditions, 

(1) If a and b are in G, a ■ b is also in G. 

(2) There is an associativity law, a ■ (b ■ c) = (a • b) ■ c, for any elements in G. 

(3) There exists a unit element e in G, which satisfies a ■ e = e ■ a = a for any 
a in G. 

(4) There exists an inverse element a -1 in G for any a in G, such that o-a -1 = 
a -1 • a = e. 

Among many examples of groups, the n dimensional orthogonal group (O(n)) and 
special unitary group ( SU(n )) are interesting in particle physics. O(n) is the one in 

which the length r 2 = x\+x \-1-b x 2 in the n dimensional real space is invariant 

under any rotation in this real space. U(n) is the one in which the length defined 

by s 2 = |xi| 2 + |x 2 | 2 +-1- |x n | 2 in the n dimensional complex space is invariant 

under any rotation in this complex space and in addition, when the determinant of 
the transformation matrix U is 1, i.e. det U = 1, it is called the SU(n) group. 

In case of a ■ b = b ■ a for any elements a, b in G, the group is called the Abelian 
or commutative group, while in case of a ■ b b ■ a, it is called the non-Abelian or 
non-commutative group. One-dimensional translation, two-dimensional rotation 
(0(2)) or phase transformation (17(1)) etc. belong to the Abelian group, while 
three-dimensional rotation (0(3)) or special unitary transformation ( SU(n ) with 
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Basics of the group theory 


n> 2) etc. belong to the non-Abelian group. Specially interesting group in particle 
physics is the Lie group in which the group element is the analytic function of the 
continuous parameter. 

Let us assume that there is a matrix or operator M(a) for any elements a in G. 
Then, if M(a) satisfys the conditions, M(a)M(b) = M(ab), M(a~ 1 ) = M -1 (a) and 
M( 1) = 1, then M(a) is called representation of the group G. 


C.2 SU(n) group and Lie algebra 


We are interested in the SU ( n ) group defined in the n dimensional complex space, 
which is expressed by an n x n unitary matrix with its determinant being equal to 

1 


C/ + = C/- 1 , det U = 1. 


(C.l) 


Then, how many parametrs are there in [7? As is well known, an n x n complex ma¬ 
trix has 2n 2 real parameters. But there are n 2 constraints among these parameters 
because of the unitarity relation WU = 1 and moreover, there is one additional 
constraint due to the condition of det U = 1. Therefore, the number of the real 
parameters of the matrix U is 2n 2 — (n 2 + 1) = n 2 — 1. For example, the unitary 
matrix for SU(2) and SU (3) has 3 and 8 parameters, respectively. Using these 
parameters 6 l (i = 1,2, • • •,n 2 — 1), the nxn unitary matrix U for SU (n) is written 
by 

17(0) = e~ i(>iLi = e~ ilt (C.2) 


where L l are the nxn matrices called the generators for the SU(n) group. Here 
the summation over i is implied. Since U is unitary, L l are hermitian matrices. In 
addition, L x is traceless due to det U = 1. Explicit expressions of L l , for example, 
for SU(2) are given by the 2x2 Pauli spin matrices (i = 1,2,3) or, equivalently, 
2x2 isospin matrices y (i = 1,2,3) with 


r 


l 


0 

1 




(C.3) 


and for SU( 3), they are given by the 3x3 Gell-Mann matrices y {i = 1,2, • • •, 8) 
with 


A 1 

A 4 


/ 0 1 0 
10 0 
\ 0 0 0 

/ 0 0 1 
0 0 0 
V 1 0 0 


A 2 

A 5 


0 -i 0 \ /100\ 

i 0 0 , A 3 = 0 -1 0 , 

000/ V 0 0 0 / 

0 0 -i \ /000\ 

0 0 0 ] , A 6 = ( 0 0 1 1 > 

*00/ \0 1 0/ 
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Ic-ClL 



SU(n) group and Lie algebra 


271 


/ 0 0 0 \ / 1 0 0 \ 
0 0 -* ’ A8 = -R 0 1 0 

\ 0 i 0/ VJ \ 0 0 -2 / 


(C.4) 


As explicitly shown above, the SU(2) group has one diagonal matrix, and 
the SU( 3) group has two diagonal matrices, and The number of diagonal 
matrices is equal to the rank of the group. The SU (n) group has n — 1 diagonal 
generators and is a group of rank n — 1. 

From the group property of U, we can see that these generators satisfy the 
following algebra which is called the Lie algebra 

[L\V]=if ijk L k , (i, j, fc = 1,2, • • •, n 2 — 1) (C.5) 

where fij k , being antisymmetric constants for exchange of any pairs of i, j and 
k, are called the structure constants of the group. Here the summation over k is 
implied. This statement can be proved as follows; let us start with the foil wing 
relation for non-commuting operators A and B, 

e A ■ e B = e A+B+ ^ A ’ B } + . (C.6) 

Now let A and B to be given by A = ia l L l = ia-L and B = if} 1 L l = ifJ-L with n 2 — 1 
parameters a 1 and /5‘, respectively, where the summation over i is implied. Then, 
since U(a) = e A and U(/3) = e B are the elements of SU(n), U(a) ■ U(0) = e A ■ e B 
is also the element of SU(n) because of the group property. Hence, e A ■ e B must 
be also written by e c with C = ij ■ L. Therefore, the commutator [L l , V] must be 
linearly related to the sum of the generators as described in (C.5). The structure 
constants fij k are determined from the associativity law of the group. 

The Jacobi identity 

[L\ [V,L k ]\ + [V, [L k ,V]\ + [. L k , [L\IA]] = 0, (C.7) 

with the Lie algebra (C.5) leads to the relation 

f jklfilm fkilfjtm fijifkim = 0* (C.8) 

Then, if we identify the matrices T 1 as (T") = -ifij k , (C.8) results in 

[T*,T>]=if iJk T k f (C.9) 

which is just the same algebra as (C.5). That is to say, the generators being just 
equal to the structure constants of the group satisfy the Lie algebra. In other 
words, the structure constants themselves generate a representation of the Lie alge¬ 
bra. The representation generated by the structure constants is called the “adjoint 
representation”. The dimention of the adjoint representation is just the number of 
generators, which is n 2 — 1 for SU(n). 
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C.3 Representation in SU(n) group 


Let us consider a field <p composed of n complex fields tp a (a — 1,2, • • •, n) and write 
it by a column vector, 


( Vi \ 
<P2 


<j) = 


(C.10) 


\ Vn ) 

If (j) changes into <f>' under the SU (n) transformation as 

</> ->• (j)' = U4>, or <p a -*tp' a = U b <p b = UabVb, 


(C.ll) 


ip a is said to be in the “fundamental representation” of SU(n) and denoted by n. 
Now, let us consider an infinitesimal transformation 


U{6) =e~ ie t ~ 1-iO-L, for 9 « 1. 
Then, we can write the field transformation as 

<Pa Va - Pa ~ <5^a, 

with 

&<Pa =i(0- L) b a (fi b = ie b a (f b . 


(C.12) 

(C.13) 

(C.14) 


Next, let us introduce the conjugate fundamental representation n*. If we take 
the complex conjugate of (C.ll) and use the relation (U*) b a = {U*) ab — (U* r ) ba = 
(U^) ba = (U^)l, where W is the hermite conjugate of U, then one can obtain the 
transformation of the congujate states ip* as 

vl-*tf = mlvl = vl{u')l (C.15) 

Now, it is convenient to introduce the upper indices to these conjugate fields as 
ip a = ip*. Then we can rewrite (C.15) as 

<p a -> ^ = V b {U') a b , (C.16) 


The representation for those fields tp a is called the “conjugate fundamental repre¬ 
sentation” and denoted by n*. Since the infinitesimal transformation associated 
with U*{6) is U*(6) ~ 1 + iO l L l * = 1 — the conjugate representation 

ip a = <£>* are generated by — L l *. Therefore, all eigenvectors for generators L l of 
diagonal matrices are still eigenvectors of — L l * but of opposite sign. Regarding this 
property, the SU( 2) group is specially interesting because it has only one diagonal 
matrix Then for 51/(2), we can expect that the 2 and 2* representation is 
equivalent. In fact, this is the case as shown in the following. Let us look for a 
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matrix A so that A<f> transforms, like <f> in (C.ll), as A<f>* -4 (A(f>*)' = U(A(j>*) 
or <j> * -» <j>'* = ( A~ l UA)4>* = Thus, we must have A _1 UA = U*, that 

is, (i = 1,2,3) since U ~ 1 — *y for SU(2). This can be 

possible, when we choose A = ir 2 = f ^ 0 ) ’ J ^ c ' ,ua ^y’ we can see ^at 

^ = (_°i l ) (^ ) = (Jy) transtormslike (H )■*•»• 2 ‘ = 2 - 

Now, let us consider the direct product of n and n* representations. 

® X 6 = (<PaX b - + \^ cX C ■ (C.17) 

By taking the trace of (C.17), we find Tr(y> a <g> x b ) = VaX a - Hence the 1st term of 
the right-hand side of (C.17) is the traceless matrix with n 2 - 1 components, 

T b = <P«X b - y^cX c , (C.18) 

n 

and denoted by n 2 - 1. The 2nd term is just the trace term having only one 
component, 

s b = -S b <PcX c , (C.19) 

n 

and is denoted by 1. From (C.ll) and (C.16), we see that the transformation law 
of T and S under the SU (n) transformation is given as follows; 

T b -4 T b = U?< PmX n U} l b - ~6 b a Ur^nX n Ul c , (C.20) 

and 

s b -4 s' a b = U b a UTv m X n Ui c . (C.21) 

Using the unitarity condition U™U}f = 6™ where the summation over c is implied, 
these transformation law can be written in the matrix form as 

T -4 T = UTU\ (C.22) 

and 

5 -4 S' = S. (C.23) 

As described above, the matrix T b is an n x n traceless matrix with n 2 —1 components 
and thus it can be expanded by the n 2 —1 traceless generators L l (i — 1,2, • • •, n 2 —1) 
of the SU ( n ) group as 

T b = A\U)l (* = 1,2, • • • ,n 2 — 1) (C.24) 
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where A 1 are the parameters of the SU(n) group. Here the summation over i 
is implied. Now, let us consider the transformation law of A 1 under the SU(n) 
transformation. Under an infinitesimal transformation 

<Pa -*■ <p'a = <Pa ~ ie b a <Pb, <p a ->■ <p' a = ip a - i(p b e%, (C.25) 

the transformation law (C.22) leads to 

T^T' = T-i[e,T\, (e = a j L j , T = A k L k ) (C.26) 

which can be written as 

A i L i ^X i L i = A'L 1 - ia*[L*, L k ]A k 

= A i L i - iac’(ifji ti L i )A k . (C.27) 

That is, 

A'* = A* + f ijk a j A k , (C.28) 

or by identifying —ifjik = which is the adjoint representation, A 1 transform 

as 

SA* = f ijk a j A k = -i(-if jik )a>A k 

= -i(Ti) ik a j A k = -i(T-a) ik A k . (C.29) 

In these equations the summation over the same subscripts is implied. 

In summary, the T* which transform according to (C.22) or (C.26) and are given 
by (C.24), have n 2 - 1 components and are called the “adjoint representation” of 
the SU(n) group denoted by n 2 — 1, while S* is invariant under the SU(n) trans¬ 
formation, that is, it transforms as a singlet and denoted by 1. We can decompose 
the product of n and n* representation into n 2 — 1 and 1, 

n ® n* = n 2 — 1 © 1. (C.30) 
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C, P and T transformation 


In addition to the continuous transformations, there are several important discrete 
symmetries in physics. Here we concentrate our attention on space inversion (P), 
time inversion (T) and charge conjugation (C). In view of the field-theoretical model 
building, it is important to study whether or not the physical system is invariant 
under these transformations, and if not, how and in what degree these symmetries 
are violated. 


D.l Parity or space inversion P 


A parity inversion is the reflection of a spatial plane. This is equivalent to a space 
refelection followed by a rotation with an angle 7T about an axis perpendicular to 
that plane after the space reflection. Thus a parity operation is equivalent to a 
space inversion. A parity operation is defined as 

P : x —> x 1 = —x, t t' = t. (D.l) 


Invariance of a physical system under parity transformation means that any inter¬ 
actions in the system should be left-right symmetric. 

Here we just list up the transformation rule of fields under parity operation. For 
simplicity, we fix the phase factor to be t)p = 1. 


Scalar field 
Pseudoscalar field 

Dirac field 
Vector field 


<j){x, t) -> <f) F (x 1 ,t') = 4>(—x,t), (D.2) 

t](x, t) -> rf{$, t ') = -t](-x, t ), (D.3) 

( ip{x,t)-+ijj_ p (x',t')=joil>{-x,t), . . 

\ ip p (x',t') = Ip(-x,t)7 o, " ' , '' 

-t V» P (x = Vfj,(—x, t). (D.5) 


Furthermore, the bilinear covariants (S(scalar), P(pseudoscalar), V(vector), 
A(axial-vector), T(tensor)) composed of Dirac fields transform as 


S : ‘ip a (x,t)ip b (x,t) ->■ %j) a {-X,t)\j) b {-X,t), 


(D.6) 
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P : 

V> a (£,t)7 5 V>(,(£,i) -*■ -■0 a (-£,t)7 5 t/)i(-£,t), 

(D.7) 

V : 

i>a{S,t)'f%j) b (S,t) -t ‘4’a(-x,t)'y li 4i b (-x,t), 

(D. 8 ) 

A : 

tpa{.x,t)rf 5 ^^ b {x,t) -> —0 a (-£,t)7 5 7 M ^ 6 (-£,t), 

(D.9) 

T : 

rp a (x,t)a ll ''ip b (x,t) -»■ •ip a (-x,t)a tiv ip b (-x,t). 

(D.10) 


An important example of parity transformation is the one for the weak left-handed 
current defined by = ?/> 0 l(£, £) 7 m V>(,l(£, t), where the left- and right-handed 

fermion fields transform under parity inversion as 

f -» (V' P )z,,fl(5,i) = 70 il>R, L (-x,t), _ D 1 

1 $L,R{x,t) {ijj P )L,R(x,t) = tpR,L(-X,t)j 0 . ' 

Then, the weak left-handed current transforms as 

-t ’>I>aR(-X,t)'yo'ya10lpbR(-X,t) 

= 4>aR{-X,t)Yl!>bR{-X,t), (D.12) 

where we used the relation, 7 o 7 M 7 o = 7 j, = 7 M . 


D.2 Time inversion T 


A time inversion is a reflection of time as defined as 


T : £ -¥ x 1 = x , t -¥ t' = —t. 


(D.13) 


Here we just list up the transformation rule of fields under time inversion. For 
simplicity, we fix the phase to be t]t = 1 . 


Scalar field 
Pseudoscalar field 

Dirac field 
Vector field 


4>(x, t) —> <j> T {^,t') — <f> (£,—£), (D.14) 

■q(x, t) r] T {&, t') = -T](x,-t), (D.15) 

f ip(x,t)^il> T (x',t') = 'y 3 'y 1 ip(x,-t), . . 

1 *P(x, t) -+ $ T {Z,t') = i> T {x, -t) 7 1 7 3 , 1 ^ 

->• V^ix 1 ,?) = V„(x,-t). (D.17) 


One should not be confused at the superscript T to Dirac fields in (D.16); the 
superscript T on the left-hand side means the time-reversed field, while the one on 
the right-hand side does the transposed field. 

Furthermore, the bilinear covariants composed of Dirac fields transform as 


S : «/> 0 (£,t)^f,(£,f) ->• ipa{x, -t)xl> b (x, -t), (D.18) 

P : ^a(£, O7 5 V’t (£;*)-> -1A0 (£,-*) l 5 ipb(x,-t), (D.19) 

V : ^a(x,t)^ip b {x,t) -> daix^t^ipbix^t), (D.20) 

A : -> 4 >a(x, -t)7 5 7 M V'6(^, -t), (D.21) 
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T : ijj a (x,t)(T ,1 ‘ / il)b{x,t) -4 -tpaix, -t). (D.22) 


D.3 Charge conjugation C 


In addition to the space-time symmetries discussed above, there are other symme¬ 
tries called internal symmetries. Among many such internal symmetries, we are 
here interested in the charge conjugation. This is a discrete transformation of par¬ 
ticles into antiparticles, in which the electric charge and other quantum numbers of 
particles are reversed without changing any kinematic attributes. 

Here we just list up the transformation rule of fields under charge conjugation. 
For simplicity, we fix the phase to be rjc — 1. 


Scalar field 
Pseudoscalar field 

Dirac field 
Vector field 


<f>(x,t)<f> c (S',t') = $ (x,t), (D.23) 

t](x,t) -4 r) G (#,t') = 77 + (m,i), (D.24) 

J -4 i> c (x',t') = C$ T (x,t), . 

\ V>(x, t) -4 ip c (x\t') = -ip T (x,t)C~ 1 , 

-4 V'* o (f',0 = -V^{x,t), (D.26) 


where the charge conjugation operator C in (D.25) is explicitly written by C = i 7 2 7 ° 
and satisfies the relations, C _ 1 7 /J C = ~ 7 M and C = -C -1 = -C* = —C T . 
Furthermore, the bilinear covariants composed of Dirac fields transform as 


S : 

4> a (x, t)lp b (x,t) -4 ijj b (x,t)ip a {x,t), 

(D.27) 

P : 

■4>a{x,t)j 5 1p b (x,t) -4 Vi i ,(f,t) 7 5 V' 0 (f,t), 

(D.28) 

V : 

4> a (x,t)^lp b {x,t) -4 -\i) b {x,t)Y^a{x,t), 

(D.29) 

A : 

’4>a{x,t)^ b -t ,1 ij) b {x,t) -4 ^(,(x,t)7 5 7 M t/> a (f,t), 

(D.30) 

T : 

tp a (x,t)a fl, 'ip b {x,t) -4 -$ b (x,t)(r ,il 'ip a (x,t). 

(D.31) 


Now, a comment on the transformation of the left- and right-fields under charge 
conjugation is useful. Applying the transformation rule (D.25) to the left-handed 
Dirac spinor, we can obtain the following result, 


(V-l) c = CrPl = C 


(^L—■V') t 7° 


-IT 


= c 


ip 


t 1 -^ n 


2 

lT 


= Cl ol_ji r 


1 + 75 _ 1 + 75 n 7T 


-C 7 V* = 


cy 


2 2 

1 + 75 V ,C = W ,c )fl = V’h- 


(D.32) 


Similarly we obtain the result, ( iPr) c = = V’lj f° r the right-handed spinor. 

Furthermore, we can also obtain the similar result even for these adjoint spinors. 
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We summarized them as 

f Vr.fl “>• ('<P C )l,r = (iPr,l) c = CrPl L , 

1 (V’ C )L,fl = (V> R,l)° = Vr, L C- 


(D.33) 


D.4 CP transformation 


We are here interested in the CP transformation properties of Dirac spinors and 
their bilinear covariants. A CP inversion is the direct product of parity operation 
and charge conjugation. Then, from the transformation rules described above, we 
obtain the following rules for Dirac spinors, 

f ip(x, t) -> f CP = 70 ip c (-x, t) = 70 Ctp T (-x, t), , 

I ip(x,t) -t tp CP (x,t) = ip c (-x,t )70 = rp T (~x,t)C-f o, 


and furthermore, combining (D.ll) and (D.33), similar rules are derived for the left- 
and right-handed spinors as 


( tpL <R {x,t) ->■ (iP cp )l,r ='yo(tp c )_ R:L (~x,t) = 'y 0 Cipl M (-x,t), 
l il> L ' R (x,t) (lp CP ) L< R(x> t ) = {' 1 ! }C )r,l{—X, t)lo = ipi >R {-x,t)Cj 0 . 

By using these transformation rules, we can obtain the CP transformation of the 
weak left-handed current as 


Jp(ab) — 1p a L{ x it)'Yn' l PbL{x,t) 


th pT $bL 

-4>bL(-X, t)^ll) aL (-X, t) 

-J ( M ba)(-*,t), (D.36) 


where the minus sign in the last two equations originates from the interchange of 
fermion fields in taking the transposition. 

Let us consider the transformation of the charged current weak interaction under 
CP inversion. The Lagrangian is given in (4.164) as 


4 Q c = ~^U' l ^VD' l W+ + -j=D' L V''fU' L W-, (D.37) 


where V is the unitary matrix, being the Cabibbo-Kobayashi-Maskawa(CKM) 
matrix for the case of 3 generations, and is the charged weak boson fields 
defined by (4.55). If we write the charged currents as J M ~ = U' L y l VD' L and 
J M+ = D' L 'y fl VW' L , we obtain the CP transformed currents as 


r (j^f p = -D' Llll v T u' L , 

\ (Jm+)CP = -U' Llll VD‘ L . 


(D.38) 
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Furthermore, the application of C and P inversions to the vecor field yields 

W± Wjf p± = -W 1 *. (D.39) 

Then, we get the following transformation, 

I d i x(U' L ^VD' L W+ + D , L 'fV*U , L W~) 

-> j d i x{D' Llll V T U' L W li - +U' L ^V*D' l W p+ ). (D.40) 

Therefore, if V = V*, i.e. V is real, the action is invariant under CP inversion, 
that is, CP is conserved. On the contrary, if V ^V*, i.e. V is not real, then CP is 
violated. 
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Appendix E 


The quark model 


In modern particle physics, the symmetry and the constituent of matter having its 
symmetry are powerful tools to understand the structure of matter and its physics. 
One such example is the proton and neutron which are the constituents of all nuclei 
and possess the SU( 2) symmetry of isospin. Various properties of nuclei are well 
understood by the dynamics of protons and neutrons having the SU (2) symmetry 
of isospin. Now, it is known that there are more than 300 hadrons, including the 
proton and the neutron. Then, it is natural to expect that there must be new 
fundamental constituents at a deeper level in Nature which build those hadrons 
systematically based on new underlying symmetries. In 1964, Gell-Mann and Zweig 
(Gell-Mann, 1964; Zweig, 1964) proposed such a new constituent model of hadrons 
called the quark model, in which the hadrons are beautifully classified with the 
SU (3) symmetry. The idea of quarks as the fundamental constituents of hadrons 
has been developed with a lot of theoretical and experimental efforts in 1960s and 
1970s, and the quark model was established as an important component of the 
standard model of modern particle physics. Here we would like to discuss the quark 
model. 


E.l Isospin symmetry 


Before discussing the quark model, let us begin by briefly reviewing the isospin 
symmetry. Nuclei are bound states of protons and neutrons and their nuclear force 
is issued from pion exchange. Interestingly the nuclear force does not distinguish 
the proton and the neutron. The proton mass m p = 938.27200 ± 0.00004 MeV is 
very close to the neutron mass m n = 939.56533 ± 0.00004 MeV and their small 
mass difference is considered to be due to the electromagnetic interaction; a proton 
has a positive charge, while a neutron is neutral. This situation naturally suggests 
that the proton and the neutron are two manifestations of the same particle, called 


the nucleon, and one can write it as a doublet 2 



of the fundamental 
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representation of the SU (2) group, that is, the proton and neutron are represented 
in the isospin space as 

' p) =(i) ■ < /S ='">=(?) ’ (,s < e -» 

which are just the same representation as the spin-up and spin-down states of spin 
| particles, respectively. Similarly, the masses of tt + , 7t° and tt~ are very close and 
they are considered to be three manifestations of the same entity, the pion. This 
suggests that these particles are degenerate without electromagnetic interactions 
and the isospin symmetry is a good symmetry under strong interactions. In fact, 
nuclear interactions between nucleons and pions have an SU( 2) isospin symmetry 
which works very well. The isospin symmetry is very effective to understand the 
structure of nuclei and nuclear interaction dynamics. 

Mathematically the isospin symmetry is just the copy of the spin symmetry; the 
generators P = y (i = 1,2,3) satisfy the same SU (2) algebra as the ones of the 
spin 

[I\P] =i£ ijk I k . (E.2) 


where is the totally antisymmetric Levi-Civita symbol with e i2 3 = 1 and the 
summation over k is implied, r 1 have the same expression as the Pauli matrices 
o x (i — 1,2,3) and their explicit expression are presented in (C.3). 

For the two-nucleon system, analogously to the spin composition of two spin \, 
one can construct the I = 1 (triplet) and I = 0 (singlet) states of the nucleon-nucleon 
system, 


I = 1 (triplet) 
I = 0 (singlet) 


1 1 = 1,13 = 1 ) =PP, 

1 1 = 1, h = 0) = ^(pn -I- np), 
\I = 1 ,/ 3 = -1) = nn, 


\I = 0, J 3 = 0) = 


-j={pn-np). 


(E.3) 

(E.4) 


As can be seen here, I = 1 (triplet) and I = 0(singlet) states are symmetric and 
antisymmetric, respectively, for interchange of the 1st and 2nd nucleon in the two- 
nucleon system. One can illustrate this situation in terms of the so-called Young 
tableaux, in which a doublet state 2, i.e. the fundamental representation of SU(2) 
is represented as □ = 2. 

= El©™* ( E - 5 ) 

2 <g> 2 = 1®3 


where 0 = 1 (singlet) and □□ = 3(triplet) represent the antisymmetric and sym¬ 
metric states, respectively. The number below the Young tableaux of (E.5) denotes 
the dimension of each irreducible representation. 
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One should be careful in constructing multiplets of nucleon-antinucleon systems. 
To see this, consider a rotation with n around the 2nd axis in an isospin frame, as 
an example of isospin transformation of a nucleon doublet, 

Now, let us define the charge conjugation operator C which transforms particles 
into antiparticles, 


Cjp) = |p) and C|n) = n), 


(E.7) 


where the charge of p and n is -1 and 0, respectively. Applying C to (E.6) leads to 


(U)-0 o)(i0- 


(E.8) 


However, the sign of the eigenvalue of / 3 for antiparticles is opposite to the one for 
particles and we want to arrange n to the upper component (I 3 = +|) and p to 

the lower component (7 3 = — |) in the antiparticle doublet, as 2’ = ^ n _ ^. Then, 

if we introduce a minus sign to n, we can see that it transforms in exactly the same 

way as the the nucleon doublet ^ P ^ under the SU (2) isospin transformation, as 


shown 


(7)-(! ‘.‘X? 


(E.9) 


(E.9) is certainly the same transformation as (E.6). Namely, 2* and 2 transform 
in the same way under the SU(2) transformation, i.e. 2* ~ 2. (Here don’t worry 
about the nasty minus sign to h.) This can be done only for the SU( 2) symmetry 
as proved in Appendix C. 

Now, we can construct the isospin states of the nucleon-antinucleon system as 
follows; 


I 

I 


1(triplet) | 
0 (singlet) 


1 1 = l,/ 3 = 1) = -pn, 

1 1 = l,/ 3 = 0) = ^(pp - nn), 
\I = 1 ,h = -1) = rap, 


\I = 0, 1 3 = 0) = —|(pp + ran). 


(E.10) 

(E.ll) 


E.2 Quark model and SU(3) symmetry 

In 1950s, new hadrons were discovered. Those new hadrons were surprisingly long- 
lived compared to the strong interaction scale. For example, the A 0 and K° are 
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easily produced in high energy 7r p scattering but those particles decay into light 
hadrons with very long lifetime, 

tT + p^A° + K°, (E.12) 

A° -> p+TT~, K° -> TT + TT~. (E.13) 

To explain the fact that while the production of A 0 and K° occurs with strong inter¬ 
action scale, the decay of those particles does with weak interaction scale, Nakano 
and Nishijima and, independently, Gell-Mann introduced a new additive quantum 
number called “strangeness” (5). They assigned S as 5 = 0 for p, n~, S = +1 
for K° and S = —1 for A 0 , and considered that while the strong interaction con¬ 
serves the quantum number 5, the weak interaction does not. In the production 
process (E.12), S is conserved but in the decay processes (E.13), it is not. Soon 
later, the idea of Nakano, Nishijima and Gell-Mann was confirmed from observed 
properties of many strange particles discovered those days. The conservation of the 
strangeness S is similar to the one of the charge Q due to the 17(1) electromag¬ 
netic symmetry. This suggests the existence of a new U( 1) symmetry. Actually 
one can introduce the symmetry called the 17(1) hypercharge symmetry, where the 
new quantum number Y, called hypercharge, is defined by the sum of the baryon 
number B and the strangeness S,Y = B + S. Because of this new 17(1) symmetry, 
the strong interaction conserves the hypercharge Y and hence, the strangeness 5 
is also conserved in strong interactions, because the baryon number B is a good 
quantum number for the strong interaction. Then, we can see that the following 
relation, being called the Nakano-Nishijima-Gell-Mann (NNG) relation (Nakano 
and Nishijima, 1953; Gell-Mann, 1953), 

Q = I 3 + j, (E.14) 

works well for all hadrons discovered those days, where Q and J 3 are the charge 
and the 3rd component of the isospin of the hadron, respectively. 

In 1964, Gell-Mann and Zweig introduced the quarks as physical substances 
to realize the relation (E.14). In the quark model, all hadrons are made of a few 
quarks; while all baryons are made of 3 quarks q, all mesons are made of a quark q 
and an antiquark q , where all quantum numbers of q is opposite to those of q. Since 
the quark model should make even strange hadrons like A 0 and K°, we need a new 
quark, i.e. the s(strange) quark in addition to the u(up) and d(down) quarks which 
nicely build the non-strange hadrons like p, n, ir, etc. Thus, in the original quark 
model the u, d and s quarks were considered to be the fundamental constituents of 
hadrons and to have the SU (3) symmetry. Note that though the SU (2) symmetry of 
isospin is a rather good symmetry due to almost equal masses of p and n, the SU (3) 
symmetry is not such a good symmetry because mass differences of strange hadrons 
and non-strange hadrons are rather big. Later, the existence of more heavier quarks 
c(charm), 6(bottom) and f(top) quarks were also established. Now, we have 6 


T/faf/tcwtaf iia/ 



Quark model and SU(3) symmetry 


285 



I 

I 3 

5 

B 

Y 

Q 

u 

1 

2 

+ 2 

0 

1 

3 

1 

3 

'3 

d 

1 

1 

0 

1 

1 

1 

2 

2 

3 

3 

3 

s 

0 

0 

-1 

1 

3 

2 

3 

1 

3 


Table E.l Quantum numbers of u,d,s quarks. 


different kinds of quarks qi (i = u,d,s,c,b,t) and these degree of freedom is called 
“flavor”, which is unrelated to another degree of freedom “color”, being the strong 
interaction charge which plays an important role in quantum chromodynamics, the 
field theory of strong interactions, being discussed in Chapter 5. The quantum 
numbers of u, d, s quarks are given in Table E.l. By taking account of these 
quantum numbers of quarks, we can make many hadrons from those quarks, like 
p = ( uud ), n = ( udd ), A 0 = ( uds ), tt + = { ud ), 7r _ = (ud ), K° = (ds), etc. 

In the SU (3) symmetric world,* the fundamental representation of the quark is 
given by the triplet (=3) 


9 = 


l:) 


\ * ) 


(E.15) 


Now, let us consider the 51/(3) transformation of this triplet, 

Qa-^q'a = U{6) b a q b , ( a,b=u,d,s) (E.16) 

U(0) = e~ iei t-, 

where 6 l ( i = 1 , 2 , • ■ •, 8) are parameters of this group and ({ — 1 , 2 , • • •, 8) are 
the generators of the SU( 3) group. Here the summation over i is implied. Explicit 
expression of A 1 is given by (C.4). These generators satisfy the Lie algebra; 

\k 

= (E.17) 

where fijk is called the structure constant and anti-symmetric under any exchange 
of i, j and k. Here the summation over k is implied. The group 517(3) includes 
the SU( 2) of isospin and the 17(1) of hypercharge as subgroups and hence it is the 
group of rank 2. It has 2 diagonal matrices, A 3 and A 8 . It is convenient to define 
the isospin and hypercharge operators as I 3 = A. and F = A_, respectively. Then, 


V A^ 
2 ’ 2 


'Here we consider an idealized world of equal quark masses of u, d and s, though they are, in fact, 
different. Therefore, the same mathematics can be applied for the color SU(3), which is an exact 
symmetry. 
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Y Y 




Fig. E.l Weight diagram for q = 3 and q = 3*. 


we can plot the members of the quark triplet with their quantum numbers of I 3 
and Y in (/ 3 , Y) space, as shown in Fig. E.l(a). The members of the antiquark 
triplet (=3*) are also ploted in Fig. E.l(b). (Fig. E.l is called the weight diagram.) 
Introducing I 3 and Y into (E.14), we obtain the charge operator, 


Q = I 3 


+ 2 ~ 


X 3 X s 


( I 


2\/3 


0 


(E.18) 


V 0 0 -w 


The diagonal elements of Q are certainly equal to the charge of the it, d and s 
quarks given in Table E.l. Then we see that the NNG relation works well even for 
quarks. 


E.3 Representations of mesons and baryons 

In the quark model, mesons are composed of qq, while baryons are of qqq. The 
fundamental representation q — 3 transforms as (E.16) under U, while the conjugate 
representation q — 3* transforms as 

q a -» q' a = q b (U') a b (E.19) 

under W, where U is unitary, t/+ = U~ l . (see (C.16)) Therefore, it is interesting 
to see the product of representations to discuss the spectroscopy of mesons and 
baryons. In general, the product of representations is not irreducible. If some rere- 
sentations of the group can be decomposed into a direct sum of other independent 
representations, they are called “reducible” and if not, they are called “irreducible”. 
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First consider the product q a q b — 3 <g> 3*, which is reducible. When we decom¬ 
pose it as 

q a q b = ( q a q b ~ 1 5 b a q c q c ) + ^5 b a q c q c , (E.20) 

we see that the 1st term is traceless because of Tr(g a g 6 ) = q a q a and has 8 compo¬ 
nents. The 2nd term is just the trace term and of only one component. The 1st 
and 2nd terms do not mix each other and hence they are irreducible. Then we can 
say that 3 0 3* can be resolved into the octet 8 and singlet 1 as 

3 ® 3* = 8 ® 1. (E.21) 

Next, consider the direct product q a qb = 3 <g> 3. It is also reducible. To resolve 
this reducible representation into two irreducible representations, let us separate it 
into the symmetric and antisymmetric parts as 

q a qb = q a qb + + ~(q a qb - qbq a ), (E. 22 ) 

where the 1 st and 2 nd terms are symmetric and antisymmetric, respectively, for 
interchange of q a and q>,. The antisymmetric part has 3 components and identical 
to the set of objects £ a 6 c<Mc 5 where e a bc is the totally antisymmetric Levi-Civita 
symbol with £123 = 1. Then if we define £ a f>c<Mc = q a , we can see that q a q a is 
invariant under the transformation of U as follows; 

q' a q' a = £abcq' a qWc = £abcUaUjfuz'q a >qb>q C ' 

— £ a'b'c ' det Uq a iqb'q c ' ~ E-a'b'c'qa'qb'qc’ 

= q a 'q a '=q a q a , (E. 23 ) 

where det U = 1. Since U is unitary, from the transformation rule of (E.16) for q a 
and the result of (E.23), we can easily find that q a , i.e. the antisymmetric part of 
(E.22), transforms as the conjugate representation 3*. The symmetric part has 6 
components and thus, we can obtain the decomposition rule, 

3 ® 3 = 3* ® 6. (E.24) 

3* and 6 do not mix under the SU( 3) transformation and each one cannot be 
decomposed any more. They are irreducible. 

Finally, what happens when we multiply one more quark state to (E.24)? In this 
case, we have two products 3* ® 3 and 6 <g> 3. The product 3* ig> 3 has 9 components 
and is decomposed into the octet 8 and singlet 1 as presented in (E.21). But here 
we should be careful about that in 1 all quarks are totally antisymmetric (as can be 
seen from q a q a in (E.23)) and in 8 the first 2 quarks are antisymmetric. The product 
6 ® 3 has 18 components and is decomposed into the decuplet 10 and octet 8. In 
10 all quarks are totally symmetric, while in 8 only first 2 quarks are symmetric. 
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K~ K° 

(b) 0“ octet (8) (c) 0“ singlet (1) 

Fig. E.2 Mesons in the quark model. 

Thus, we can summarize the result of the decomposition of qqq states as 

3 <g> 3 <g> 3 = (3* © 6)<g>3 = (3* <g> 3) © (6 <g> 3) 

= 1 © 8yiffi8s®10, (E.25) 

where the first 2 quarks are antisymmetric and symmetric in 8^ and 85, respectively. 
Likewise in the SU(2) case, we can illustrate the quark triplet (E.15) as □ = 3. 
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In the Young tableaux, 0 represents the antisymmetric combination of 2 parti¬ 
cles. Since the antisymmetric part of (E.22) being identical to the set of objects 
£ofcc969c = Q a has 3 components, we illustrate it as 0 = 3*, which is the conjugate 
representation corresponding to the antiquark q. Moreover, □□ represents the sym¬ 
metric combination of 2 particles and thus, the symmetric part of (E.22) having 6 
components are illustrated as □□ = 6. 

• Mesons 

In the quark model, mesons are made up from a quark q and an antiquark q. Here 
we consider an example of pseudoscalar mesons with J p = 0“. 

qq = 3 <g> 3* = 1 ® 8. (E.26) 

Or using Young tableaux, one can write this as 

= =©EP. (E.27) 

3 ® 3* = 10 8 

The number below the Young tableaux denotes the dimension of each irreducible 
representation. Therefore, mesons are represented by a sum of the octet 8 and the 
singlet 1. By combining the weight diagrams of q (Fig. E.l(a)) and q (Fig. E.l(b)), 
we can show this in a different way as shown in Fig. E.2. The same configuration 
is obtained even for vector mesons with J p = 1~. 

• Baryons 

Baryons are composed of three quarks qqq and its product of representations are 
given in (E.25). The Young tableaux for this configuration is given as 

qqq = D®D®D= (00m) ®D= I © EP © 0- 1 © EIX1 (E.28) 

3 <g> 3 ® 3 = (3* © 6) ® 3 = 1 © 8 © 8 © 10 

Similarly to mesons, by combining the weight diagram of Fig. E.l(a), we can 
represent the baryons as in Fig. E.3. 
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